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Preface to the Second Edition 


There are three changes in the second edition. First, with the help of readers 
and colleagues -thanks to all— I have corrected typographical errors and 
made minor changes in substance and style. Second, I have added a few more 
Exercises, especially at the end of Chapter 4. Third, I have appended a section 
on Differential Geometry, the essential mathematical tool in the study of 
two-dimensional structural shells and four-dimensional general relativity. 


Jamks G. Simmonds 



Preface to the First Edition 


When I was an undergraduate, working as a co-op student at North Ameri- 
can Aviation, I tried to learn something about tensors. In the Aeronautical 
Engineering Department at MIT, I had just finished an introductory course 
in classical mechanics that so impressed me that to this day I cannot watch a 
plane in flight especially in a turn — without imaging it bristling with vec- 
tors. Near the end of the course the professor showed that, if an airplane is 
treated as a rigid body, there arises a mysterious collection of rather simple- 
looking integrals called the components of the moment of inertia tensor. 
Tensor what power those two syllables seemed to resonate. I had heard the 
word once before, in an aside by a graduate instructor to the cognoscenti in 
the front row of a course in strength of materials. “What the book calls stress 
is actually a tensor. . . .” 

With my interest twice piqued and with time off from fighting the brush- 
fires of a demanding curriculum, I was ready for my first serious effort at 
self-instruction. In Los Angeles, after several tries, I found a store with a book 
on tensor analysis. In my mind I had rehearsed the scene in which a graduate 
student or professor, spying me there, would shout, “You’re an under- 
graduate What are you doing looking at a book on tensors?” But luck was 
mine: the book had a plain brown dust jacket. Alone in my room, I turned 
immediately to the definition of a tensor: “A 2nd order tensor is a collection 
of n 2 objects that transform according to the rule . . .” and thence followed an 
inscrutable collection of superscripts, subscripts, overbars, and partial deriv- 
atives. A pedagogical disaster! Where was the connection with those beauti- 
ful, simple, boldfaced symbols, those arrows that I could visualize so well? 

1 was not to find out until after graduate school. But it is my hope that, 
with this book, you, as an undergraduate, may sail beyond that bar on which 
I once foundered. You will find that I take nearly three chapters to prepare 



Preface to the First Edition 


you for the shock of the tensor transformation formulas. I don't try to hide 
them they’re the only equations in the book that are boxed. But long be- 
fore, about halfway through Chapter 1, I tell you what a 2nd order tensor 
really is — a linear operator that sends vectors into vectors. If you apply the 
stress tensor to the unit normal to a plane through a point in a body, then 
out comes the stress vector, the force/area acting across the plane at that 
point. (That the stress vector is linear in the unit normal, i.e., that a stress 
tensor even exists, is a gift of nature; nonlinearity is more often the rule.) The 
subsequent “debauche dcs indices” that follows this tidy definition of a 2nd 
order tensor is the result of exposing the gears of a machine for grinding out 
the workings of a tensor. Abolish the machine and there is no hope of pro- 
ducing numerical results except in the simplest of cases. 

This book falls into halves: Algebra and Calculus. The first half of the first 
half (Chapter 1 ) emphasizes concepts. Here, I have made a special effort to 
relate the mathematical and physical notions of a vector. I acknowledge my 
debt to Hoffman’s intriguing little book. About Vectors (Dover, 1975). (But 
there are points where we differ - I disagree with his contention that vectors 
cannot represent finite rotations.) Chapter 2 deals mostly with the index 
apparatus necessary to represent and manipulate vectors and tensors in gen- 
eral bases. Chapter 3, through the vehicle of Newton’s law of motion, intro- 
duces moving frames and the Christoffel symbols. To help keep the basic 
kinematic ideas and their tensor generalizations in mind simultaneously, l list 
a number of equations in dual form, a device that I have found successful in 
the classroom. The last chapter starts with a homely example of the gradient 
and builds to the covariant derivative. Throughout this chapter there are 
applications to continuum mechanics. Although the basic equations (exclud- 
ing electricity and magnetism) were known by the 1850’s, it was only under 
the spur of general relativity that tensor analysis began to diffuse into this 
older field. (In my own specialty, shell theory, tensor analysis did not appear 
until the early 1940’s, in the Soviet literature, even though the underlying 
theory of surfaces and their tensor description had been central to the under- 
standing of general relativity.) 

I have provided no systematic lists of grad, div, curl, etc. in various coor- 
dinate systems. Such useful information can be found in Magnus, Oberhet- 
tinger, and Soni, Formulas and Theorems for the Special Functions of Mathe- 
matical Physics, 3rd enlarged edition. Chapter XII, Springer-Verlag 1966; or 
in Gradshteyn and Ryzhik, Tables of Integrals, Series and Products, 4th edi- 
tion, corrected and enlarged. Academic Press, 1980. 

It is a happy thought that much of the drudgery involved in expanding 
equations and verifying solutions in specific coordinate systems can now be 
done by computers, programmed to do symbol manipulation. The interested 
reader should consult “Computer Symbolic Math in Physics Education,” by 
D. R. Stoutemyer, Am. J. Phys., vol. 49 (1981), pp. 85-88, or “A Review of 
Algebraic Computing in General Relativity,” by R. A. d’lnverno, Chapter 16 
of General Relativity and Gravitation, vol. 1, ed. A. Held, Plenum Press, N.Y. 
and London, 1980. 



I am pleased to acknowledge the help of three friends: Mark Duva, a 
former student, who, in his gracious but profound way, let me get away with 
nothing in class; Bruce Chartres, who let me filter much of this book through 
his fine mind; and Ernst Soudek, who, though not a native speaker, tuned the 
final manuscript with his keen ear for English. 

Finally, my thanks to Carolyn Duprey and Ruth Nissley, who typed the 
original manuscript, and then with patience and good humor, retyped what 
must have seemed to be hundreds of petty changes. 


James G. Simmonds 
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CHAPTER I 


Introduction: Vectors 
and Tensors 


The magic of this theory will hardly fail to impose itself on anybody who has truly 
understood it; it represents a genuine triumph of the method of absolute differen- 
tial calculus, founded by Gauss. Riemann, Christoffel, Ricci and Levi-Civita .' 

This little book is about tensor analysis, as Einstein’s philosophers’ stone, 
the absolute differential calculus, is called nowadays. I have written it, 
though, with an eye not toward general relativity, but to continuum mechan- 
ics, a more modest theory that attempts to predict the gross behavior of “the 
masses of matter we see and use from day to day: air, water, earth, flesh, 
wood, stone, steel, concrete, glass, rubber , . 1 2 

Continuum mechanics is a limiting case of general relativity; yet it is best 
treated on its own merits. Viewed thus, there is a fundamental difference at 
the foundations of the two theories. The geometry of continuum mechanics 
is that of three-dimensional Euclidean space (E 3 for short) and the real line, R. 
The geometry of general relativity is that of a four-dimensional Riemannian 
manifold, (A sphere is a two-dimensional Riemannian manifold.) To those 
who will settle for nothing less than a complete understanding of general 
relativity (and who, therefore, will want to consult Gravitation, by Misner, 
Thorne, and Wheeler), take heart. From the tools that we shall fashion comes 
the gear to scale that pinnacle. And to those content to cultivate the garden 
of continuum mechanics, let me say that, embedded within it, are intrinsically 


1 Albert Einstein, “Contribution to the Theory of General Relativity”, 1915; as quoted and 
translated by C. Lanczos in The Einstein Decade, p. 213. 

2 Truesdcll and Noll, The Non-Linear Field Theories of Mechanics, p. 1. Two outstanding intro- 
ductory texts on continuum mechanics are A First Course in Rational Continuum Mechanics, 2nd 
ed, by Truesdcll and Continuum Mechanics by Chadwick. 
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curved two-dimensional continua, called shells, that in dwarf form exhibit 
nearly all of the mathematical foliage found in full-flowered general relativity. 

In attempting to give mathematical form to the laws of mechanics, we face 
a dichotomy. On the one hand, if physical events and entities are to be 
quantified, then a (reference) frame and a coordinate system within that frame 
must be introduced. 3 On the other hand, as a frame and coordinates are mere 
scaffolding, it should be possible to express the laws of physics in frame- and 
coordinate-frcc form, i.c. in invariant form. Indeed this is the great program 
of general relativity. 

In continuum mechanics, however, there are exceptional frames called 
inertial ; Newton’s Law of motion for a particle— force equals mass times 
acceleration — holds only in such frames. 4 A basic concern of continuum 
mechanics is therefore how laws such as Newton’s change from one frame to 
another. 5 Save for Exercise 4.24, we shall not analyze changes of frame. 
Rather we shall study how, within a fixed frame, the mathematical represen- 
tation of a physical object or law changes when one coordinate system (say 
Cartesian) is replaced by another (say spherical). 

In what follows, I have assumed that you remember some of the plane and 
solid geometry that you once learned and that you have seen a bit of vector 
algebra and calculus. For conciseness, I have omitted a number of details 
and examples that you can find in texts devoted to vectors. At the same 
time I have emphasized several points, especially those concerning the physi- 
cal meaning of vector addition and component representation, that are not 
found in most conventional texts. The exercises at the end of each chapter are 
intended to amplify and to supplement material in the text. 


’ A frame is a mathematical representation of a physical apparatus which assigns to each event 
e in the physical world it a unique place (i.c. point) in E 3 and a unique instant on the real line 
R. 1 like to imagine an idealized, all-seeing stereographic video camera mounted on 3 rigid, 
mutually perpendicular rods. The rods have knife edges that intersect at a point and one of the 
rods carries a scratch to fix a unit of length. The 3 knife edges (indefinitely prolonged) are 
represented by a right-handed Cartesian reference frame Oxyz in E 3 , and one instant (arbitrarily 
chosen) is taken as the origin of the real line. The exposed tape is a physical realization of a 
framing (to use the terminology of Truesdell, op. cit), i.c. a map / from if to E 3 x R. 

A coordinate system in a frame assigns to each place a unique triple of real numbers (u, i>, tv) 
called spatial coordinates and to each instant a unique number l call the time. 

4 Inertial frames are also special, but in a different way, in general relativity where frames are 
coordinate systems! (and physics is geometry). An inertial frame may be introduced in general 
relativity in the same way as a two-dimensional Cartesian coordinate system may be introduced 
in an arbitrarily small neighborhood of a point on a sphere. 

5 To change frames means, for example, to tape the world with a copy of our super camera. If the 
cameras arc in relative motion, then the two exposed tapes / and f will map the same event e 
into different places P and P in E 3 and into different instants T and T on R. Of course, the two 
sets of knife edges are represented by the same frame 0 xyz and the cameras run at the same rate 
This change of frame is a special type of time-dependent map of E 3 x R into itself that preserves 
the distance and elapsed time between two events. When the elapsed time is zero, this transfor- 
mation has the same form as a rigid body motion. Sec Exercise 4.19 and Truesdell. op cit. 
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Three-Dimensional Euclidean Space 

Three-dimensional Euclidean space, E 3 , may be characterized by a set of 
axioms that expresses relationships among primitive, undefined quantities 
called points, lines, etc. 6 These relationships so closely correspond to the 
results of ordinary measurements of distance in the physical world that, until 
the appearance of general relativity, it was thought that Euclidean geometry 
was the kinematic model of the universe. 


Directed Line Segments 

Directed line segments, or arrows, are of fundamental importance in Euclid- 
ean geometry. Logically, an arrow is an ordered pair of points, (A,B). A is 
called the tail of the arrow and B the head. It is customary to represent such 
an arrow typographically as AB, and pictorially as a line segment from A to 
B with an arrow head at B. (To avoid crowding, the arrow head may be 
moved towards the center of the segment). Assigning a length to an arrow or 
multiplying it by a real number (holding the tail fixed) are precisely defined 
operations in E 3 . 

Two arrows are said to be equivalent if one can be brought into coinci- 
dence with the other by a parallel translation. 7 In Fig. 1.1, AB and CD are 
equivalent, but neither AB and EF nor AB and GH are. 

The set of all arrows equivalent to a given arrow is called the (geometric) 
vector of that arrow and is usually denoted by a symbol such as v. A vector 
is an example of an equivalence class and, by convention, a vector is repre- 
sented by any one of its arrows. 

Equivalence classes are more familiar (and more useful) than you may 
realize. Suppose that we wish to carry out, on a computer, exact arithmetic 



Figure 1.1. 


6 This was Hilbert's program: reduce geometry to a branch of logic. No pictures allowed! See. 
for example, the discussion at the end of Eisenhart’s Analytic Geometry. Our approach, however, 
shall be informal and visual. 

7 A definition that makes no sense on a sphere. Why? 
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Figure 1.2. 


operations on rational numbers. Then, for example, f must be read in as the 
ordered pair of integers (2, 3). We test for the equivalence of two ordered pairs 
of integers ( a,b ) and (c,d) stored within the computer by checking to see if 
ad = be. In doing so, we are tacitly using the definition of a rational number 
a/b as the equivalence class of all ordered pairs of integers ( c,d ) such that 
ad — be. 

In practice, it is expedient (and rarely causes problems) to confound a 
“number”, such as two-thirds, with its various representations e.g., 2/3, 4/6, 
etc. Likewise, we shall be using the term “vector” when we mean one of its 
arrows (and vice versa), relying on context for the proper interpretation. Thus 
in Fig. 1.2 we call any one of the equivalent arrows “the vector v." 

The length of a vector v is denoted by |v| and defined to be the length of 
any one of its arrows. The zero vector, 0, is the unique vector having zero 
length. We call the unit vector 

V = v/|v|, v # 0, (1.1) 

the direction of v; 0 has no direction. 

We may choose, arbitrarily , a point 0 in E 3 and call it the origin. The vector 
x (of the arrow) from 0 to a point P is called the position of P. We shall 
sometimes write P(\) as shorthand for “the point with position x.” 


Addition of Two Vectors 

Addition of two vectors u and v may be defined in two equivalent ways. 8 

A. The Head-to-Tail-Rule (Fig. 1.3a). Take any arrow representing u, say 
AB. For this choice there is a unique arrow BC representing v; u + v is 
defined to be the vector of the arrow AC. This definition is convenient if one 
wishes to add a string of vectors (Exercise 1.1), but commutativity is not 
obvious. For reasons of symmetry it may be preferable to use the following. 


“ The equivalence and uniqueness of the two definitions can be proved from the postulates of 
Euclidean geometry. 
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B. The Parallelogram Rule (Fig. 1.3b). Let u and v be represented by any 
two arrows having coincident tails, say AB and AD Then u + v is the vector 
of the arrow AC, where C is the vertex opposite A of the parallelogram 
having AB and AD as co-terminal edges. 

Adding three or more vectors in this way is a bit awkward graphically, but 
u + v + w has the following neat interpretation in three-dimensional space. 
Let u, v, and w be represented by arrows AB, AC, and AD. Then u + v + w is 
the vector of the arrow, with tail at A, lying along the diagonal of the paral- 
lelopiped having AB, AC, and AD as co-terminal edges. See Exercise 1.2. 


Multiplication of a Vector v by a Scalar a 

Multiplication of a vector v by a scalar a is defined in an obvious way: if AB 
is an arrow representing v, then otv is the vector of the arrow otAB, (Recall that 
in Euclidean geometry, we do multiplication with the aid of similar triangles). 

The set of all geometric vectors, together with the operations of addition 
and multiplication by a scalar, form a linear vector space. Other familiar 
examples of linear vector spaces are the set of all polynomials of degree n, the 
set of all solutions of a linear homogeneous differential equation of order n, 
and the set of all m x n matrices together with appropriate definitions of 
addition and multiplication by a scalar. 

Things That Vectors May Represent 

Many physical and kinematic objects — the displacement from one point to 
another, a force acting on a particle, the finite rotation of a rigid body about 
an axis- have direction and magnitude. We may represent these attributes 
by vectors 9 In doing so, we must keep in mind two fundamental points. 

9 A phrase such as “the vector f representing the attributes of a force” is a precise mouthful. For 
platability we may pare it to “the vector f representing a force” or even to “a force f ” When the 
context is clear, the tidbit “f” may suffice. 
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A. Different Types of Objects are Represented by Vectors That Belong to 
Different Vector Spaces. Otherwise we could add, for example, a force to a 
displacement. Nevertheless, for conciseness and clarity, we often place differ- 
ent types of vectors in the same picture, as Fig. 1.4 that shows the position x, 
the velocity v, the acceleration a, and the force f acting on a cannon ball flying 
through the air. 

B. Vector Addition May or May Not Reflect an Attribute of the Objects 
Represented. For displacements, forces, or velocities, there are obvious phys- 
ical analogues of vector addition; for successive finite rotations of a rigid 
body about a fixed point, there is not. We shall say more about vector addi- 
tion later. 10 


Cartesian Coordinates 

Thanks to Descartes, we may characterize three-dimensional Euclidean 
space in algebraic terms as follows. Through the origin 0 draw three mutually 
perpendicular (1) but otherwise arbitrarily chosen lines. Call one of these the 
x-axis and on it place a point / ^ 0. The ray (or half-line) from 0 containing 
/ is called the positive x-axis. 01 is called the unit arrow along the x-axis and 
we denote its vector by e x . Choose one of the remaining lines through 0, call 
it the y-axis, and place on it a point J such that the length of OJ is equal to 
that of 01 . OJ is called the y-unit arrow and we denote its vector by e r The 
remaining line through 0 is called the z-axis and, by arbitrarily adopting the 
right-hand rule, we may place a unique point K on the z-axis* 1 such that the 


10 Physically, there arc differences in how displacements, forces and velocities add: displace- 
ments follow the head-to-tail-rule, forces, at a common point, the parallelogram rule, while 
velocities “add like vectors” only because of the postulates of continuum mechanics concerning 
moving frames. Velocities do not add like vectors in relativity theory. 

1 1 This means that if we curl the fingers of our right hand from 01 to OJ, then our thumb will 
point in the direction of OK. 
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length of OK is equal to that of 01 . OK is the z-unit arrow and e 2 denotes its 
vector. 

Any point P may be represented by an ordered triple of real numbers 
(x,y,z), called the Cartesian coordinates of P. The first number, or x-coordi- 
nate, is the directed (1) distance from the yz-plane to P. Thus x is positive if 
P lies on the same side of the yz-plane as does / and negative if it lies on the 
opposite side; x is zero if P lies in the yz-plane. The second and third coordi- 
nates, y and z, are defined in an analogous way. To indicate that a point P 
has coordinates (x, y, z) we sometimes write P(x, y,z). 

When a vector v is represented by the arrow whose tail is the origin 0, then 
the coordinates of the head of this arrow, say (v x ,v y ,v z ), are called the Carte- 
sian components of v. We indicate this relation by writing v ~ (t> x , v y , d,). Thus, 
in particular, 

e x ~ (1,0,0), e„~ (0,1,0), e 2 ~ (0,0,1) (1.2) 

x~(x,y,z). (1.3) 

With a way of assigning Cartesian components (v x ,v y ,v z ) to a vector v, and 
vice-versa, we may easily deduce the following relations. 

i) |v| = y/v 2 x + v y + v z , (1.4) 12 

by the Pythagorean theorem. 

ii) If a is a real number, then 



ocv ~ (av x ,ctv y ,av z ). 

(1.5) 

iii) If w ~ (w x , vv 

’ y , w 2 ), then 



V ± w ~ (v x ± W x ,V y ± W y ,V z ± VV 2 ). 

(1.6) 

iv) 

V = W O V x = W x , V y = Wj,, v z = w 2 . 

(1.7) 


These relations allow us to represent and to manipulate vectors on a com- 
puter. 


The Dot Product 

The dot product of two vectors u and v, denoted by u v, arises in many 
different physical and geometric contexts. 13 Some authors define the dot 
product by the formula 

12 Here we arc deducing the algebraic properties of E 3 from its geometric ones. To go the other 
way, which is easier in many respects, we define E 3 to be the set of all ordered triples of real 
numbers fx, y,z)such that the distance between any two points (x, , y, , z, ) and (x 2 ) y,,z 2 )is given 
by 

■Jl * t - x 2 ) 2 + Oh - V 2 ) 2 + (*1 - z 2 ) 2 - 

13 There are certain technicalities to be considered when we try to interpret the dot product 
physically. Sec the discussion at the end of the chapter. 



I. Introduction: Vectors and Tensors 



Ivl cos 8 U 
Figure 1.5. 


u ■ V = | U 1 1 V | cos 0, Oi£0 5S7t, (1.8) 

where “0 is the angle between u and v.” This definition seems simple and 
reasonable. But wait. If all we know is how to compute the length of a vector, 
how do we compute 0? We can’t unless we turn (1.8) into the definition of 0. 
Then we have no choice but to come up with a definition for u • v that is 
independent of 0. Here goes. 

If | u 1 1 v | = 0, take u v = 0. Otherwise, consider Fig. 1.5, which may be 
constructed using strictly geometric methods. The double-headed arrow indi- 
cates that 0 is always nonnegativc, regardless of the relative orientation of u 
and v. As shown, v may be expressed as the sum of a vector |v| cos 0 0, parallel 
to u, and a vector v L , 1 to u. Take first the larger of the two right triangles in 
the sketch and then the smaller to obtain, by the Pythagorean theorem, 

|v| 2 = |v| 2 cos 2 0 + |vj 2 , (1.9) 

|v - u | 2 = ( | V | cos 0 | u | ) 2 + |vj 2 

= |v| 2 cos 2 0 - 2|u| |v| cos 0 + | u | 2 4- |vj 2 
= — 2 1 u 1 1 v | cos 0 + |u| 2 + |v| 2 , (1.10) 14 

by (1.9). Comparing (1.8) with (1.10), we are led to the definition 

uv = A(|u| 2 + |v| 2 -|v-u| 2 ). (1.11) 

Note that if |u| | v| =0, then u • v = 0, in agreement with our earlier definition. 
If u ~ v, 

vv = |v| 2 , (1,12) 

Two vectors whose dot product is zero are said to be orthogonal (1). 

To evaluate u • v on a computer, we need a component representation. This 
is easy to find. If u ~ (u x ,u y ,u 2 )and v ~ (v x ,v y ,v x ), then, from (1.4) and (1.1 1), 

»•» = «*«>, + «,»» + “*»«• ( 113 ) 


14 This result is usually called the law of cosines. 
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Suppose that, without changing u or v, we were to introduce another set of 
Cartesian axes. The components of u and v relative to the new axes would, in 
general, be different, but would be right side of (1.13) change? The individual 
terms, being the products of components, would, of course, but their sum 
would not. Why? Because (1.11) defines u v in terms of lengths and lengths, 
in Euclidean geometry, may be defined without reference to coordinate axes. 
We say, therefore, that the dot product is a geometric invariant. 

Another key property of the dot product is that it is distributive with re- 
spect to addition, i.e. 

u (v + w) = u v + u w, Vu, v, w. (1.14) 

To prove (1.14) let w ~ (w x ,Wj„ w z ). Then, by (1.13), 

u (v + w) = u x (v x + w x ) + u y (v y + Wy) + u z (v. + w.) 

= u x v x + u y v y + u z v z + u x w x + u y w y + u z w z . 

= u V + u w. □ 

This proof illustrates an important point: often, geometric facts are proved 
most easily using Cartesian coordinates. Yet, as (1.14) is coordinate-free, 
it must follow directly from (1.11). This is Exercise 1.7. Always try to inter- 
pret formulas both algebraically and geometrically, for one of these 
viewpoints, though awkward for proof, may lead to new insights or suggest 
generalizations. 


Problem 1.1. 

Ifu ~ (1,2,3) and v ~ ( — 3,1, —2), compute u v and the enclosed angle. 


Solution. 

From (1.13) 

u v = (1)( — 3) + (2)(1) + (3) (—2) = -7, 

and from (1.8) 




Hence 


0 = cos 1 U V 0 < 9 < n. 
|u||v| 

|u| = v /l 2 + 2 i + 3 2 = /14 

|v| = V(-3) a + l a + (-2) 2 = V'4 

0 = cos"'(— 7/14) = 120" = 2 ti/ 3 radians. 
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Cartesian Base Vectors 

Given the Cartesian components (v x ,v y ,v z ) of any vector v, (1.6) and (1.5) 
allow us to set 


(t>*.®i.»«) = (t' x , 0, 0) + (0,L' ,0) + (0,0, r 2 ) 

(1.15) 

= mi, 0,0) + m°, i,0) + t? z (o,o, i). 

Recalling (1.2) and (1.6), we infer that v has the unique representation 

v = v x c x ~i~ VyC y -T v z c z . (1.16) 

The set {e x ,e y ,e z } is called the standard Cartesian basis, and its elements, 
the Cartesian base vectors. The Cartesian components of v may be referred to, 
alternatively, as the components of v relative to the basis {e x ,e,,,e 2 }. 


The Interpretation of Vector Addition 

It is essential to emphasize the distinction between vector addition, as a 
reflection of a physical or kinematic attribute, and vector addition, as it is 
used on the right side of (1.16). Suppose, for example, that v represents the 
rotation of a rigid body about a fixed point. Let the direction and magnitude 
of v represent, respectively, the axis and angle of rotation, using the right 
hand rule to insure a unique correspondence. If this rotation is followed by 
another, represented by a vector u, then it is a fact from kinematics that these 
two successive rotations are equivalent to a single rotation that we may 
represent by a vector w. However, in general, v + u / w, i.e., successive finite 
rotations do not add like vectors.' 5 Nevertheless, the additions that appear on 
the right hand side of (116) do make sense if we do not interpret the sum- 
mands individually as rotations. Let me try to illustrate this point with a 
simple analogy. Suppose that there are 20 students and 30 chairs in a class- 
room. For statistical purposes we may speak of of a student per chair” 
even though, physically, there is no such thing as § of a student. The students 
and chairs arc represented by whole numbers and what we did to get a 
statistic was to perform a mathematical operation on these whole numbers. 

Vector addition does mirror a physical attribute when, for example, v in 
(1.16) represents a force. It is an experimental fact that “forces add like vec- 
tors.” See Exercise 1.3. In this case the mathematical decomposition of a 
vector into components happens to have a direct physical interpretation. The 
analogy here might be that of a dining hall in which there are 20 one-gallon 
bottles of milk and 30 pitchers. Not only can we speak of “§ gallons of milk 

1 5 Rotations may also be represented by matrices. If the two successive rotations are represented 
by the matrices V and V, then they are equivalent to a single rotation represented by the matrix 
W = V V. u, v, or w may be identified, respectively, with the single real eigenvector of U. K or W 
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per pitcher,” we can also, physically, put § gallons of milk into each pitcher. 
Here the mathematical operation of division of whole numbers has a physical 
counterpart. 


The Cross Product 

The cross product arises in mechanics when we want to compute the torque 
of a force about a point, in electromagnetics when we want to compute the 
force on a charge moving in a magnetic field, in geometry when we want to 
compute the volume of a parallelopiped or tetrahedron, and in many other 
physical and geometric situations. 16 

The cross product of two vectors u and v is denoted by u x v and defined 
to be the right oriented area of a parallelogram having u and v as co-terminal 
edges. That is, 

|ux v| = |u||v|sin0, O<0<7t (1.17) 

where the direction of u x v is that of the thumb on the right hand when the 
fingers are curled from u to v. Two possibilities are indicated in Fig. 1.6. 
From the definition of the cross product, 

uxv = — vxu. (1.18) 

Further useful properties of the cross product follow from a geometrical 
interpretation of the scalar triple product (u x v)-w. Consider the parallel- 
epiped having u, v, and w as co-terminal edges and let vol (u, v, w) denote its 



16 The remarks made earlier concerning physical interpretations of the dot product apply to the 
cross product as well. See the discussion at the end of this chapter. 
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Figure 1.7. 


volume. By regarding this volume as the limit of the sum of right parallel- 
epipeds of vanishingly small altitude and identical bases, each base parallelo- 
gram having u and v as co-terminal edges, we have 

vol(u,v,w) = |(u x v)-w|. (1.19) 17 

If (u, v, w), in that order, form a right handed system, as shown in Fig. 1.7, 
then, by symmetry, 

voI(u,v,w) = (u x v) -w = (w x u) v = (v x w)u > 0, (1.20) 

i.e. (u x v) w is invariant under cyclic permutation. 

With the aid of (1.20), we shall verify that the cross product is distributive 
with respect to addition : 

ux(v + w) = uxv + uxw. (1-21) 

To do so we use the fact that two vectors are equal if and only if their dot 
products with all vectors c are equal . 18 Thus, 

[u x (v + w)] c = (c x u)(v + w), by (1.20) 

= (c x u)'v + (c x u)-w, by (1.14) 

= (u x v)-c + (u x w) c, by (1.20) 

= (u x v + u x w) c, by (1.14). □ 


Problem 1.2. 

Prove the vector triple product identity 

(uxv)xw = (uw)v-(vw)u (1.22) 

without introducing components. 

1 7 We hardly need calculus to get this. Imagine a deck of identical cards, each in the shape of a 
parallelogram. The volume of the deck is its thickness times the area of the face of a card, and 
this volume is obviously unchanged if the deck is sheared into the shape of a parallelepiped 

18 Why? Well, if u = v, then, certainly u c = v-c for all c. On the other hand, lfu c = v c for 
all c we can, in particular, take c = u - v, which implies that u ■ c - v • c = (u v) c |u v) 
(u - v) - u - v| 2 '= 0. This in turn implies that u - v = 0, i.e., u = v. 
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Solution. 

The vector u x v is 1 to the plane of u and v. Therefore (u x v) x w, which is 1 to 
u x v, must lie in the plane of u and v. That is, 

(u x v) x w = Au + fiv, (*) 

where A and B are unknown scalar functions of u, v, and w. As the left side of (*) is 1 
to w, it follows that 

0= Au w + Bv w, 

which implies 

A = — C(v-w), B = C(u ■ w), 
where C is an unknown scalar function of u, v, and w. Hence 

(u x v) x w = C(u,v,w)[(uw)v - (vw)u], (**) 

In the special case u = w, (**) reduces to 

(u x v) x u = C(u,v,u)[|u| 2 v - (v u)u], (***) 

Take the dot product of both sides with v. Permuting the resulting scalar triple 
product on the left, we have 

(u x v) • (u x v) = C(u, v,u)[|u| 2 |v| 2 — (u-v) 2 ]. (****) 

But 

(u x v) (u x v) = |u x v| 2 

= |u| 2 |v| 2 sin 2 0, by (1.17) 

= |u| 2 |v| 2 (l - cos 2 0) 

= |u| 2 |v| 2 - (u v) 2 , by (1.8). 

Comparing the last line of this expression with the right side of (****) we have that 
C(u,v,u) =1. 

Now take the dot product of both sides of (**) with u. Permuting the resulting 
scalar triple product on the left, we have 

[u x (u x v)] • w = C(u,v,»)[(u-»)(vu) — (v-w)|u| 2 ]. 

But (***) with C(u,v,u) = 1 implies that 

[u x(u x v)]w = (vu)(u w)-|u| 2 v w 

Hence, C(u,v,w) = 1. □ 

Another useful interpretation of the scalar triple product is in terms of 
projected areas. See Exercise 1.15. 

How can we find the Cartesian components of u x v? This is not as easy as 
the analogous problem for u • v. For one reason, u x v is a vector rather than 
a scalar, and for another, u x v is a beast that lives in three dimensions only! 
(There is an algebra created by Grassmann that gives meaning to objects 
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such as u x v or u x v x w in higher dimensional Euclidean spaces where 
they are called wedge products.) The straightforward (but tedious) way is to 
note that repeated application of the distributive law (1.21) allows us to set 

u x v = (u x e x + u y e y + u 2 e 2 ) x (t> x e x + t^e, + r 2 e 2 ) 

= u x v x e x x e x + « x t!,,e x x e,, + w x t> 2 e x x e 2 

+ u y v x e y x e x + u y c,e y x e,, + u y v 2 e y x e 2 

+ u z v x e z x e x + u z v y e z x e y + u z v z e z x e 2 . (1.23) 

(Phew!) But e x x e x = 0, e x x e y = e 2 , e x x e 2 = -e,,, etc. (Draw a sketch!) 

Thus (1.23) reduces to 

u x v = e x (u y v z - u z v y ) + e y (u z v x - u x v z ) + e 2 {u x v y - u y v x ). (1.24) 
An easy way to remember (1.24) is to write it as 


u 


it z 

v z 


(1.25) 


where the formal determinant is to be expanded about its first row. 

From (1.24), (1.25), and the property of a determinant that interchanging 
rows and column leaves the value unchanged and interchanging two rows (or 
columns) changes the sign, we get the useful formulas 


(u 





t> x 


Vy 

v z 


(1.26) 


Problem 1.3. 

Determine a unit vector e mutually 1 to the vectors a ~ (1, -2,3) and b ~ (-1,0, 1) 
using and without using the cross product. 


Solution. 


In terms of the cross product, the vector we seek is 


± 


a x b 
la^bl ' 


From (1.25), 


-2 3 ~ (—2, —4, —2). 

0 1 


Hence |a x b| = 2^6, so that e ~ ±(1/^/6, 2/^6, 

To compute e without using the cross product, let e ~ (a,b,c). Then e a — a — 
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2b + 3c = 0 and eb = - a + c = 0, which implies that b = 2a and c = a. Hence e ~ 
«(1,2, 1). To make |e| = 1, set, a = ± X/Ji. 


Alternative Interpretation of the Dot and 
Cross Product. Tensors 

In physics we learn that a constant force F acting through a displacement D 
does F D units of work. Unlike displacement, force is never measured di- 
rectly, only its effects (extension of a spring, change in the resistance of a 
strain gauge, etc.). Also, force has different physical units from displacement. 
This suggests that forces — more precisely, the vectors that represent forces— 
lie in a different space than do displacements. If forces are measured in, say, 
pounds and displacements in feet, then the unit x-arrows 01 that we set up 
arbitrarily in each space represent, respectively, one pound of force and one 
foot of distance. Think now: how would you graphically compute F D? You 
might, first, bring the^positive x, y, and z-axes of each space into coincidence 
and next, since the OV s were chosen arbitrarily, stretch or shrink one until it 
coincided with the other. Finally, the length of the orthogonal projection of 
F onto D would be multiplied graphically by |D| to obtain a number with the 
units of foot pounds. 

All of this imaginary pushing and shoving was to give F Da geometric 
meaning so that one could do graphical computations (an ancient art little 
practiced nowadays.) Isn’t there a simpler way of looking at things that, while 
not affecting numerical computation, better mirrors different types of physi- 
cal objects? Yes there is: a force F may be thought of, mathematically, as a 
representation of a linear functional that sends any vector D (a displacement) 
into a real number (called the work of F through D). 

Analogous but more elaborate considerations hold for the cross product. 
For example, suppose that a force F acts at a point P with position x. Repre- 
senting the torque about O as x x F leads to the notion of linear operators 
that send vectors into vectors. Such operators are called 2nd order tensors. 
The name tensor comes from elasticity theory where in a loaded elastic body 
the stress tensor acting on a unit vector normal to a plane through a point 
delivers the tension (i.e., the force per unit area) acting across the plane at 
the point. See Exercise 1.20. Other important 2nd order tensors include the 
inertia tensor in rigid body dynamics, the strain tensor in elasticity and the 
momentum-flux tensor in fluid dynamics. 

The simplest, nontrivial example of a 2nd order tensor that I can think of 
is the following. Let the projection of a vector v on a vector u be denoted and 
defined by 

Proj u v = (v • 0)0. (1.27) 

The geometric meaning of Proj u v is shown in Fig. 1.8. The left side of (1.27) 
may be interpreted as the action of the operator Proj u on the vector v The 
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Proj u v 


Figure 1.8. 


right side of (1.27) tells us that Proj u sends v into a vector of magnitude v u 
in the direction of u. To qualify for the title tensor, Proj u must be linear. But 
if /S and y are arbitrary scalars and w is an arbitrary vector, then, by the 
properties of the dot product, 

Proj u (/?v + yw) = [(fly + yw)-u]u 
= (/?v 0 + ywu)u 
= /l(vu)u + y(wu)u 


= fi Proj u v + y Proj u w , ( 1 .28) 


i.e., Proj u is linear and therefore is a tensor. Now work Exercise 1.5 to make 
sure that you can apply this tensor in a concrete situation. 

The form of the right side of (1.27) suggests the following generalization. 
The direct product uv of two vectors u and v is a tensor that sends any vector 
w into a new vector according to the rule 


uv(w) = u(v • w). 

Thus, in particular, 

Proj u = uu. 


(1.29) 19 

(1.30) 


Tensors such as Proj u , that can be represented as direct products, are called 
dyads. As we shall see, any 2nd order tensor can be represented as a linear 
combination of dyads. 


Definitions 

To say that we are given a 2nd order tensor T means that we are told T’s 
action on (i.e., where it sends) any vector v 20 . Thus two 2nd order tensors S 
and T are said to be equal if their action on all vectors v is the same. More 

1 9 Many authors denote the direct product uv by u <8> v. 

20 The action of T on v will be denoted by Tv, T(v). or T v, as convenient. To a fastidious 
mathematician, the description of T is not complete without mention of its domain, the set of 
vectors on which T acts, and its range , the space into which T sends these vectors. We shall 
assume that the domain and range of a 2nd order teasor are obvious from the context, though 
often they are different; c.g., the domain of the moment of inertia tensor is the space of angular 
velocities, but its range is the space of rotational momenta (see Exercise 4.22). 
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formally, 


S = T Sv = Tv, Vv, (1.31) 

or, equivalently, 

S = T o u Sv = u Tv, Vu, v, (1.32) 

The zero and identity (unit) tensors are denoted and defined, respectively, by 
Ov = 0, Vv, and Iv = v, Vv. 

The transpose of a 2nd order tensor T is defined as that unique 2nd order 
tensor T T such that 


u Tv = v -T t u, Vu, v. (1.33) 

A 2nd order tensor T is said to be 


(a) symmetric if T = T T . 

(b) skew (or antisymmetric) if T = -T T . (1.34) 

(c) singular if there exists av ^ 0 such that Tv = 0. 

An arbitrary tensor T may always be decomposed into the sum of a symmet- 
ric and skew tensor as follows: 


T = |(T + T T ) + ^(T - T t ), (1.35) 

T + T' 1 = (T + T t ) t being symmetric and (T - T T ) = -(T - T t ) t being 
skew. 


Problem 1.4. 

If v ~ (v x ,v y ,v z ) and Tv ~ (-2c* + 3v z , — v z ,v x + 2v y ), determine the Cartesian com- 
ponents of TV 


Solution. 

Let TV ~ (a,b,c) and u ~ (a,fi,y). By definition, u ■ T T v = v-Tu for all vectors u and v. 
That is, 

oca + fib + yc = v x (-2x + 3y) + v y ( - y) + v 2 (a + 2p) 

= x(-2v x + v z ) + p(2v z ) + y(3Vjc - v y ). (*) 

As u is arbitrary, the coefficients of a, /?, and y on both sides of (*) must match. Hence 
T'v ~ ( - 2v x + d z ,2v z ,3v x - v,). 


The Cartesian Components of a Second Order Tensor 

The Cartesian components of a second order tensor T fall out almost auto- 
matically when we apply T to any vector v expressed in terms of its Cartesian 
components. Thus 
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Tv = T(i; x e x + v y e y + v z e 2 ) 

= i; x Te x . + vfTe y + t> 2 Te 2 , by the linearity of T. (1,36) 

But Te x , Te } „ and Te 2 are vectors and therefore may be expressed in terms of 
their Cartesian components, which we label as follows: 

Te x = T xx e x + T yx e y + T 2X e 2 (1.37) 

Te v = T xy e x + T yy e y + T zy e z (1.38) 

Te, = T x2 e x + T yz e y + T zz e z . (1.39) 

The 9 coefficients T xx , T xy , . . . , T., are called the Cartesian components of T. 

We indicate this by writing T ~ T, where T T is the matrix of coefficients 
appearing in (1.37) to (1.39). 

Here’s how to keep the subscripts straight: From (1.37) to ( 1 .39), T xx = 
e x • Te x , T xy = e x ■ Te,, etc. 

Problem 1.5. 

Determine the Cartesian components of the tensor T defined in Problem 1.4. 
Solution. 

Applying T successively to e x , e y , and e 2 , we have 

Te x ~ (-2,0,1)= -2(1,0, 0) +(0,0,1) 

Te, ~ (0,0,2) = + 2(0,0, 1) 

Te 2 ~ (3, — 1,0) = 3(1, 0,0)- (0,1,0) 

Hence, 



Problem 1.6. 

If u ~ (u x , u y , u 2 ), then u x may be regarded as a 2nd order tensor whose action on 

any vector v ~ (r x , v y , c 2 ) is defined by (1.24). Find the Cartesian components of u x , 

Solution. 

Applying u x , successively, to e x , e,, and e 2 , we have, according to (1.24), 

u x e x ~ (0, u 2 , - u y ) = u z (0, 1,0)- u,(0, 0, 1) 

u x e, ~ (— u 2 ,0,u x ) = — u 2 (l,0,0) + u x (0, 0, 1) 

u x e 2 ~ (u y , — u x , 0) = u,( 1,0,0) - u x (0, 1,0) 
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Hence 



The Cartesian Basis for Second Order Tensors 

Given the Cartesian components (v x ,v y ,v z ) of a vector v, we recover the 
vector via the representation v = v x e x + v y e y + v z e z . Given the Cartesian 
components of a 2nd order tensor T, what is the analogous representation for 
T?To save space we shall answer, first, in 2-dimensions. From (1.36) to (1.38), 


Tv = v x (T xx e x + T yx e y ) + v y (T xy e x + T yy e y ). (1.40) 

But v x e x = (v • e x )e x = e x e x (v), etc. Thus 

Tv = (T xx e x e x + T xy e x e y -I- T yx e y e x + T yy e y e y )v, Vv. (1.41) 

Because v is arbitrary, we infer fom (1.31) that 

T = T xx e x e x + T xy e x e y + T yx e y e x + T yy e y e y . (1.42) 

Is this representation unique? You bet. For suppose there is another, say 

T = T xx e x e x + T' y e x e y + T; x e y e x + T' y e y e y . (1.43) 

Then (1.42), take away (1.43), implies that 

UT XX - T’ x )e x e x + (T xy - T’ y )e x e y + ■ ] v = 0, Vv, ( 1 .44) 

which is equivalent to 

u l(T xx - T xx )e x e x + (T xy - T^ y )e x e y + • ] v = 0, Vu, v. (1 .45) 


But if u = v = e x , (1.45) reduces to T xx — T xx = 0; if u = e x , v = e>„ to T xy — 
T xy = 0, etc. Thus (1.42) and (1.43) must be identical. 

In three dimensions, the conclusion is that, uniquely, 

T = T xx e x e x + T xy e x e y + T xz e x e z 
-I- T yx e y e x + T yy e y e y + T yz e y e z 
+ T zx e z e x + T zy e z e y + T. z e z e z . (1.46) 

Thus the set |e x e x ,e x e )1 , . . . ,e z e z } of the 9 possible direct products of the 3 
Cartesian base vectors is a basis for the set of all 2nd order tensors. That is, 
any 2nd order tensor can be represented as a unique linear combination of 
these 9 dyads. 



20 


I. Introduction: Vectors and Tensors 


Exercises 

1.1. (a). Using graph paper, draw the five vectors 

P ~ (1, 1), <1 ~ (1,2), r ~ (2, 1), s ~ (0, — 3), t ~ ( — 1, 1) 
so that each has its tail at the origin. 

(b) . Using the head-to-tail rule for addition and graph papier, place the tail of q 

at the head of p, the tail of r at the head of q, etc., and indicate the sum of the 
five vectors as a vector whose tail coincides with that of p and whose head 
coincides with that of t. 

(c) . What are the Cartesian components of the vector u such that p + q + • - - + 

t + u = 0? 

1.2. Draw a careful perspective sketch illustrating that if u and v and then (u + v) 
and w are added according to the parallelogram rule, the last sum will be the 
diagonal of a parallelepiped having u, v, and w as co-terminal edges. 

1.3. Describe in words and with a sketch an experiment to demonstrate that forces 
add like vectors. 

1.4. If u ~ (2, —3,4) and v — (1,0, 1), compute u v and the enclosed angle. 

1.5. Using the vectors given in Exercise 1.4, compute Proj u v and Proj v u. 

1.6. Washington, D.C. lies roughly at latitude 39"N and longitude 77 W and Mos- 
cow roughly at latitude 56°N and longitude 38' E Taking the radius of the earth 
as 4000 miles, compute the great circle distance between these two cities. 

Hint: Imagine vectors from the center of the earth to the cities and use the dot 
product. 

1.7. Using (1.11), prove the distributive law (1.14) without introducing Cartesian 
coordinates. 

Hint: Use the fact that if a and b are co-terminal edges of a parallelogram, then 
2|a| 2 + 2|b| 2 = |a + b| 2 + |a - b| 2 . Apply (1.11) to u (v + w) and in the result- 
ing expression set v + w — u = (v — ^u) + (w — |u) = a + b. 

1.8. The Schwarz inequality 

|u v| < |u||v| (*) 

is an example of a simple geometric fact that has a useful analog for functions: 
if / and g are square integrable on an interval (a,b), then 

j| /ffdxj< JU 1 dx g 2 dx. (**) 

(a). Prove (*) in three ways by using 

(i) . (1.25) and the fact that 

|u x v| 2 ^0. 

(ii) . the fact that the second degree polynomial 

p(x) = |u + xv| 2 

is non-negative for all real values of x. 
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(iii). (1.11) and the fact from Euclidean geometry that the length of a side of 
a triangle is greater than or equal to the difference between the lengths 
of the other two sides. 

(b). Use the analogue of (i) or (ii) for functions to prove (**). 

1.9. Let a and b be given three-dimensional vectors and x unknown. Without in- 
troducing components, show that the unique solution of the linear algebraic 
equations 

x + a x x = b (*) 


_ b + (a • b)a + b x a 
1 + a a 

Hint: Set x = Aa + Bb + Ca x b and solve for A, B, C. To prove uniqueness, 
note that if y is another solution of (*), then 

x - y + a x (x - y) = 0. 

What can you conclude from this last fact? 

1.10. If a ~ ( — 2, 1,0) and b ~ (3,2, 1), 

(a) Compute a x b. 

(b) find the equation of the plane spanned (i.e,, determined) by a and b, 

1.11. Given a point P(x 0 ,y 0 ,z 0 ) and a plane 

n: Ax + By + Cz = D, (*) 

derive a formula for the (shortest) distance from P to n by using 

(a) , calculus to minimize (x - x 0 ) 2 + (y - y 0 ) 2 + (z — z 0 ) 2 subject to (*). 

(b) . vector methods to construct a 1 from P to n. 

Hint; Recall what N ~ {A, B, C ) represents. 

1.12. If u ~ (1, - 1,2), v ~ (3,2, 1), and w ~ (4, 1,7), compute 
(a), uv(w) (b). vu(w) (c), wu(u) 

1.13. Show that 

(a) , (uv) T = vu 

(b) . for any vectors a, b, c, (a x b) x c = (ba - ab)(c). 

1.14. Using either a calculus or geometric argument (a physical take-apart model is 
extremely useful here), show that the volume of a tetrahedron with co-terminal 
edges u,v,w is (i)|(u x v)-w|. 

1.15. Give a convincing (though not necessarily rigorous) argument to show that 
|(u x v) w| is the area of the parallelogram with co-terminal edges u and v 
projected onto a plane 1 to w. 

1.16. Show that in 3 dimensions. 


U = V «■ 11 X C = V X c, Vc. 


1.17. Let T be the tensor defined in Problem 1,4, 
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(a) . If v ~ (v x , v y , t-'J, fill in the blanks. 

Sv = X(T + T T )v ~ (-, -) 

Av = l(T-T>~( - 

(b) . Determine the matrices of the Cartesian components of S and A. 

(c) . Find the Cartesian components of the vector to such that A = to x . 

1.18. Let A be an arbitrary, 3-dimcnsionaI skew tensor. 

(a) . By expressing A in terms of its Cartesian components (and noting that only 

3 of these can be assigned arbitrarily since A = - A T ), find a vector to such 
that 

Av = to x v, Vv. 

(b) . Use the results of Exercise 1.16 to show that to is unique, to is called the axis 

of A and is important in rigid body dynamics. See Exercise 4.20. 

(c) . Show that v • Av = 0, Vv, in any number of dimensions. 

(d) . Show that A to = 0. Check this result by using the numerical values ob- 

tained in Exercise 1.17(c). 

1.19. Let p and v denote, respectively, the density and velocity of a fluid at a given 
point P in space at a given time t. If n is a plane with normal n passing through 
P, then the momentum flux across n at P and t is defined to be pv(v ■ n) = pvv(n). 
As indicated in Fig, 1.9, pv(vn)dAdr is the momentum at P and t carried across 
an oriented differential element of area ndA in time dt. pvv is called the momen- 
tum flux tensor at P and t. 

(a) . If v — (r x> determine the Cartesian components of pvv. 

(b) . If v — (3, — 1,2) and p = 4 at a given point and time, determine the momen- 

tum flux across the plane with normal n — ( — 1, 1,3), 

1.20. As in Fig. 1. 10, let ndA denote an oriented differential element of area at a point 
P and time t in a continuum (e.g. a fluid or solid) and let tdA denote the force 
that the material into which n points exerts across dA, t is called the stress at P 
and t in the direction if; t„ = Proj„t the normal stress, and t, = t — t„ the shear 
stress. By considering the equations of motion of a tetrahedron of the material 
of arbitrarily small volume, instantaneously centered at P, it can be shown that 
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Figure 1.10. 


t = Tn, where T = T T is the (Cauchy) stress tensor at P and t. 21 If 



and n ~ (1,2, - 1), compute the normal and shear stress. 

1.21. The directions of principal stress are defined as those unit vectors x such that 
Tx = Ax. Note that Ax is just the normal stress and that the shear stress is zero 
in a principal direction. The determination of all possible values of A and the 
associated directions x is an eigenvalue problem. You have seen such problems 
in linear algebra. Find the eigenvalues A and the associated eigenvectors x for the 
stress tensor T of the preceding exercise. 

1.22. (a). Determine the matrix of the Cartesian components of the direct products 

uv, uw, and vw, where u, v, and w are given in Exercise 1.12. 

(b) . Compute det(nv), det(uw), det(vw). 

(c) . Show that the determinant of the direct product of any two vectors is zero. 

1.23. The trace of the direct product is denoted and defined by 

tr(uv) = u v, tr(uv + wx) = tr(uv) + tr(wx). 

Use this definition and (1.46) to find an expression for trT in terms of its Car- 
tesian components. 

1.24. The derivative of a second order tensor T, that depends on a single parameter t, 
may be defined by the rule 

Ta s (Taf V constant a. (1.47) 

(a). By definition, T sends vectors into vectors. Show that T is also linear and 
hence a 2nd order tensor. 

21 The set of all unit vectors is not a linear vector space (Why)? and so is not a suitable domain 
for T However, the definition and domain of the stress tensor can be extended in an obvious 
way. as suggested by Noll: TO = 0, Tv = |v|Tv, Vv / 0. 
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(b) . If v — (t),, v y , t>J, where v may depend on t, and tv ~ ( — 2 t 2 v x + 3 t 3 v t , 

cos 7ttv zr v x + 2tv„ + V 'T + t 2 i>J, fill in the blanks: 

+v~(- , , )■ 

(c) . If T = xy, where x and y are functions of t, what is T? 

(d) . Given the Cartesian components of T, how do we obtain those oft? 


1.25. The composition or dot product of two second order tensors S and T is a 2nd 
order tensor denoted by S • T and defined by 

S Tv = S(Tv), Vv 


(a) . If S = wx and T = yz, what is S • T? 

(b) . If S and T are differentiable functions of t, show that 

(ST)=St+$T (1.48) 

1.26. With reference to Exercise 1.18a and using (1.22), show that 

(to x) 2 = m x (m x ) = A 2 = mm — ( m ■ m) 1 (1.49) 

and hence that 

M 2 = — itr A 2 . 


1.27. The inverse of a second order tensor. A function / is said to be one-to-one (1:1) 
fix i) = fix 2 )=>x, = x 2 . In this case / has a unique inverse f l such that 
r'(fix)) = x, for all x in the domain of /. Moreover fif~ l (y)) = y for all 
y in the range of /. If T is a 2nd order tensor, then Tv t = Tv 2 implies that 
T(v, - v 2 ) = 0, by linearity. IfT is non-singular, this, in turn, implies that v, = 
v 2 . That is, T has a unique inverse T~‘ such that 


T 1 Tv = v, Vv 
T-T‘*w = w, Vw. 


(1.50) 


(a) . Show that T 1 is linear (and hence a 2nd order tensor as the notation 

presumes.) 

(b) . Show that (T~‘) r = (T 7 ) 1 . 

(c) . Show that the tensor defined in Problem 1.4 is non-singular. 

(d) . If w ~ (iv,, Wj,, w 2 ), and T is the tensor in Problem 1.4, fill in the blanks: 

T-‘w~ (-,-,-). 

Hint: Let T ‘w ~ ( a,b,c ) and note that the action of T on T ‘w must equal 


1.28. A 2nd order tensor T is said to be positive definite if 
x Tx > 0 Vx # 0. 

(a) . Show that if T is positive definite, then all the eigenvalues of T must be 

positive. 

(b) . Show that if T is non-singular, then T r • T is postive definite. 



CHAPTER II 


General Bases and Tensor 
Notation 


While the laws of mechanics can be written in coordinate-free form, they can 
be solved, in most cases, only if expressed in component form. This requires 
that we introduce a basis. Though the standard Cartesian basis is often the 
simplest, the physics and geometry of a problem, and especially the so-called 
boundary conditions, may dictate another. For example, if we wished to 
study the temperature distribution in a body the shape of a parallelepiped, 
we would choose most likely a basis consisting of vectors lying along three 
co-terminal edges of the body. An aim of tensor analysis is to embrace arbi- 
trary coordinate systems and their associated bases, yet to produce formulas 
for computing invariants, such as the dot product, that are as simple as the 
Cartesian forms. 


General Bases 

Let [ g { , g 2 , g 3 } denote any fixed set of noncoplanar vectors. Then any vector 
v may be represented uniquely as 

v = f'gi + v 2 g 2 + tr*g 3 = £ v%. (2.1) 

Used thus, the set {gi , g 2 , g 3 } is called a basis, and its elements, base vectors. 
The base vectors need not be of unit length nor mutually 1. A basis is illus- 
trated in Fig. 2.1. 

To understand the representation (2. 1 ) geometrically, we first consider the 
2-dimensional case. In Fig. 2.2 we have drawn a typical vector v and a basis 
{g,,g 2 }, i.e., two fixed, non-zero, non-parallel vectors. Through the head of v 



26 


II. General Bases and Tensor Notation 



Figure 2.1. 



we draw a line parallel to g 2 . It will intersect the line along g, at some point 
P. The vector of OP is then some unique scalar t>‘ time g,. (Thus i; 1 might be 
negative though we have drawn the figure as if it were positive.) We construct 
v 2 in a similar way. 

In 3-dimcnsions we are given a vector v and a basis {gi,g 2 ,g 3 }, all vectors 
having a common tail O. A plane passed through the head of v parallel to the 
plane of g 2 and g 3 will intersect the line along g, in some point P. This 
determines a unique scalar v‘ such that n'g! is the vector from 0 to P. In like 
fashion we determine v 2 and v 3 . 

The representation (2.1) is effected analytically as in the following. 


Problem 2. 1 . 
Given 


V.» 3 ). 


find (r 1 


g. ~ (1, - U2), g 2 ~ (0, 1, l),g 3 ~ (- 1, -2, 1) 
v ~ (3, 3, 6), 
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Solution. 

Equating corresponding Cartesian components on both sides of (2.1), i.e. applying 
rules (1.5) (1.7), we get 

3 = t>‘ - tr* 

3 = - v l +v 2 - 2v 3 
6 = 2v' + v 2 + i> 3 . 

Solving these simultaneous linear algebraic equations, we have 
(»V,t> 3 ) = (2,3,-l). 


The Jacobian of a Basis Is Nonzero 

Unless you are good at perspective, it may be difficult to sec from a sketch 
whether three vectors form a basis. (In higher dimensions it is hopeless.) Is 
there a numerical test for a set of vectors to be a basis? Yes, providing we 
know their Cartesian components. As Problem 2.1 illustrates, the answer 
hinges on whether a set of n simultaneous linear algebraic equations in n 
unknowns has a unique solution. It does, as you well know, if and only if 
the determinant of coefficients does not vanish. Let us rephrase this fact as 
follows. If G = [g l5 g 2 , • ] denotes the n x n matrix whose columns are the 
Cartesian components ofg,, g 2 , ..., then {g 1 ,g 2 , . . .} is a basis if and only if 
det G # 0. Using almost standard terminology, we shall call G the Jacobian 
matrix of {g,,g 2 ,. . .} and J = det G the Jacobian of (g,,g 2 ,.. .}.* 


The Summation Convention 

The summation convention, invented by Einstein, gives tensor analysis much 
of its appeal. Observe in (2.1) that the dummy index of summation i is re- 
peated. Moreover, its range, 1 to 3, is already known from the context of the 
discussion. Therefore, without any loss of information, we may drop the 
summation symbol in (2.1) and write, simply, 

v = v%. (2.2) 

“Dummy” means that the symbol i in (2.1) or (2.2) may be replaced by any 
other symbol without affecting the value of the sum. Thus p'g, = v j gj = v k g k , 
etc Replacing one dummy index by another is one of the first tricks a would- 
be index slinger must learn. 

We attach, however, one useful proviso: the summation convention applies 


' the label “Jacobian” is usually applied when the components of g,, g 2 , are partial deriva- 
lives, a situation we shall meet in Chapter III. 
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only when one dummy index is “on the roof” and the other is "in the cellar''. 
Thus u't>, = d 1 !!, + v 2 v 2 + v 3 v 3 , but v‘v‘ = t>V or v 2 v 2 or v 3 v 3 . Generally, 
repeated roof or cellar indices occur in quantities that are not invariants or 
not the components of invariants. ( Cartesian tensor notation , discussed in 
Exercise 2.21, is the one exception to this proviso.) 


Computing the Dot Product in a General Basis 

Suppose we wish to compute the dot product of a vector u = u‘g t with a 
vector v = v'g,. In doing so we must replace one pair of dummy indices, say 
the second, by another. Otherwise, blind application of the summation con- 
vention yields, incorrectly, u v = u'v'gjg; = u l v l g t g, + u 2 v 2 g 2 g 2 + 
u 3 v 3 g 3 g v The correct expression is 

u v = uVgj g; 

= ti l v l gi g, + U l v 2 g, g 2 + u'^gi gj 

+ ti 2 v'g 2 g, + ■■■ + wVgj gj. (2.3) 

This extended expression for u v is a nine-term mess. We can clean it up by 
introducing a set of reciprocal base vectors. 


Reciprocal Base Vectors 

What these are and how they simplify things is seen most easily in 2-dimen- 
sions. Given a basis {g, , g 2 }, we represent, as before, the first factor’in the dot 
product u- v as u = u‘g, + « 2 g 2 . (In this two-dimensional example, formulas 
are short enough that we do not need the summation convention.) However, 
let us represent the second factor in the dot product in terms of some new 
(and as yet unknown) basis {g',g 2 }, writing v = t^g 1 + v 2 g 2 . Then 

u v = (u'g, + u 2 g 2 )fv i g 1 + v 2 g 2 ) 

= "‘f I gr g 1 + u l v 2 gi • g 2 + u 2 t>, g 2 ■ g 1 + u 2 v 2 g 2 ■ g 2 . 

The idea is not to choose g 1 and g 2 so that the above expression reduces to 
U V = u l v t + u 2 v 2 . 

Thus we require that g t g 1 = g 2 g 2 = 1 and g, g 2 = g 2 g‘ = 0. Given g, 
and g 2 , we may construct g 1 and g 2 geometrically as follows: 

As in Fig. 2.3a, draw a line 1 to g 2 . Since g* g 2 = 0, g 1 must lie along this 
line. Now adjust the direction and length of g 1 until g 1 ■ g, = 1, as indicated 
in Fig. 2.3b. Repeat to construct g 2 . 

With the geometry well-understood (especially after the completion of Ex- 
ercise 2.22), we now develop an algebraic method (that a computer can use) 
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\ 

(a) (b) 

Figure 2.3. 


of constructing reciprocal base vectors in n-dimensions. This will take us on 
a small detour before returning to the problem of simplifying the component 
form of (2.3). 

Let {gi,g 2 , - } = {g,} be a basis. Then, det G # 0. As you recall from 
matrix theory, this implies that G _I exists. The elements in the ith row of G _l 
may be regarded as the Cartesian components of a vector g', i.c.. 


G -1 



= [g , ,g 2 ,..-] r =[g i ]. 


where T denotes transpose. (Consistent with this notation we may set G = 
[gj] when we wish to regard G as a collection of column vectors.) The law of 
matrix multiplication says that the element in the ith row and 7 th column of 
G~'G is the sum of the products of corresponding elements in the ith row of 
G“‘ and the jth column of G. Thus, G 'G = / is equivalent to the statement 


g‘g 1 = *} 


if i = J 
if i * j' 


(2.4) 


The symbol Sj is called the Kronecker delta. It is ubiquitous in tensor anal- 
ysis. The set {g',g 2 , • = (g'} is called a reciprocal basis and its elements, 
reciprocal base vectors. The name basis is justified because det G -1 # 0. 


Problem 2.2. 

Find the reciprocal base vectors for the basis given in Problem 2.1. 


Solution, 

We must compute G 1 and then read off the Cartesian components of g 1 , g 2 , and g 3 . 
A systematic way of computing G" 1 is to reduce G to / by a sequence of elementary 
row operations (involving, possibly, row interchanges). This same sequence of opera- 
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tions applied to / will produce G 1 . We adjoin I to G and carry out these operations 
simultaneously, as follows. 

1 0 - 1 | 1 0 ol F 1 0 - 1 | 1 0 0~ 

-1 1 — 2 | 0 1 0 -> 0 1 -3| 110 

2 1 1 | 0 0 lj L° 1 3 1-201 



10-11 1 0 o' 


\ 0 0 1 \ -li 

- 

01-3|1 10 

- 

0 1 0 1 -i 1 1 


_0 0 6|-3-ll_ 


_0 0 1 I 


Thus g 1 ~ (1/6) (3, - 1, 1), g 2 ~ (l/2)( — 1, 1, 1), g 3 ~ (1/6) ( — 3, - 1, 1). 


The Roof (Contravariant) and Cellar (Covariant) 
Components of a Vector 

If {gj is a basis then not only may we express any vector v as v‘g h we may 
also represent v as a linear combination of the reciprocal base vectors, thus: 

v = v t g‘. (2.5) 

Breaking with tradition, we shall call the coefficients v‘ the roof components 
of v and the g f the cellar base vectors. Likewise, in (2.5), the shall be called 
the cellar components of v and the g‘ the roof base vectors. The conventional 
names for v 1 and v, are the “contravariant” and “covariant” components of 
v -names that seem to me awkward and meaningless. “Roof” and “cellar” 
also have mnemonic value in matrix theory where A) is sometimes used to 
denote the element of a matrix A that sits in the ith row and _/th column. The 
following diagram indicates how to remember which is the row and which is 
the column index: 

^C(olumn) 'War, 


Problem 2.3. 

Compute the cellar components of the vector v given in Problem 2.1. 


Solution. 

Writing (2.5) in extended form and noting (2.4), we have 

=(flg' + fjg 2 + fjg 3 ) gl 
= g' gl + «-'2g 2 gl + »3g 3 -gl 


But from the information given in Problem 2.1, v g, = (3)(1) + (3)( — 1) + 
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(6)(2) = 12. Likewise, 

t , 2 = vg 2 =(3)(0) + (3)(l) + (6)(l) = 9 
= v • g 3 = (3)( - 1) + (3)(— 2) + (6)(1) = - 3. 
The solution of Problem 2.3 illustrates a useful fact: 


Vt = vg,. (2.6) 

One would also hope that 

»‘ = »-g*. (2.7) 


To get some practice with the Kronecker delta, let us establish (2.7) starting 
from (2.2) and (2.4), but without writing out anything in extended form. (After 
all, tensor notation is designed to keep things compressed.) Taking the dot 
product of both sides of (2.2) with g J we have 

vg> = (v‘gt) g J 

= v< g J g, 

= v‘6/ 


□ 


( 2 . 8 ) 


Let’s take this derivation one line at a time. The second follows from the 
first because (au + f}\ + ■ ■ ■ ) • w = au • w + pv ■ w + ■ ■ ■ . The third line follows 
because (2.4) obviously holds if i and j on both sides are replaced by any 
other distinct symbols, say j and i. You’ve seen such renaming with functions 
where, for example, /(x,y) = x/y implies that /(y,x) = yjx. 

How is the last line in (2.8) obtained? Consider one of the possible values 
that the free index j may assume in the preceding line, say 2. Then, summing 
over the repeated index i, we have v • g 2 = v'8 2 + v 2 8 2 + t> 3 (5 2 . But 8 2 = 8 2 = 
0 while <5f = 1. Thus the sum collapses to simply v 2 . The step from the next 
to last to the last line of (2.8) shows that the Kronecker delta may be regarded 
as a replacement operator. That is, multiplying v' by <5/ replaces the index i on 
v by the index j. Finally, note that \ g J = v J and v‘ = v g' are equivalent 
statements, just as /(x) = x 2 and z 2 = f(z ) are. 


Simplification of the Component Form of the Dot 
Product in a General Basis 

Let us return to the problem of simplifying the extended component form of 
u v. Referring both vectors to the same basis led to an explosion of terms 
when we wrote (2.3) in extended form. Instead, let us set u = u‘g, but v = Vjg J . 
Then 

u‘Vjgig J = U'VjSf = U% = «*!>! + U 2 v 2 + M 3 t>3, 


uv = 


(2.9) 
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a formula no more complicated than that involving the Cartesian compo- 
nents of u and v. An alternative way to compute the dot product is to set 
u = Ujg' and v = v J gj. Then 

uv = «,!>'. (2.10) 


Problem 2.4. 

Given u = 2g, - g 2 + 4g 3 and w = - 3g‘ + 2g 2 - 2g J , compute u v, w v, w v, and 
u w, where v and the cellar basis vectors are given in Problem 2.1 and the roof basis 
vectors are given in the Solution to Problem 2.2. 


Solution. 

The roof components of u are given, while the cellar components of v are given in the 
solution in Problem 2.3. Thus, by (2.9), 

u-v = (2) (12) + (- 1)(9) + (4)( — 3) = 3. 

The cellar components of w are given while the roof components of v are given in the 
Solution to Problem 2.1. Thus, by (2.10), 

w • v = ( — 3)(2) + (2)(3) + ( — 2)( — 1) = 2. 

Finally, from (2.9) 

u w = (2)( - 3) + ( - 1)(2) + (4)( - 2) = - 16. 


Computing the Cross Product in a General Basis 

Sometimes it is convenient to denote the roof and cellar components of a 
vector v by (v) 1 and (v) j; respectively. With this notation, 

u x v = (u x v) t g* (2.11) 

To compute the cellar components of u x v we set u = u'g, and v = v J g } , so 
obtaining 

(u x v)* = (u x v)g* 

= u‘v J ( g, x gj) g k 

= u l v J € iJk . (2.12) 

The 3 x 3 x 3 = 27 symbols e ijk are called the cellar components of the 
permutation tensor P (See Exercise 2.13). Let us examine their properties. 
From (2.12) and (1.26), 

*123 = (8i x 82)83 = J, theJacobian. 

If we interchange two indices there is a change in sign, e.g., switching 2 and 3 
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above yields 

*132 =(g| x g 3 ) 82 =(82 x g,) g 3 = -(gj x g 2 )g 3 = -J, 
and switching 1 and 2 in this expression yields 


*231 = (82 x g 3 ) • g, = (g, x g 2 ) ■ g, = J. 


Moreover, if two or more indices are equal, e iJk = 0 because we can always 
permute the vector triple product (g f x g 2 ) g k so that the first term is the 
cross product of a vector with itself. 

In summary, 


' +J 

€ tjk = ' — ^ 

0 

Thus, returning to ( 2 . 1 1 ), 


if (i,j,k) is an even permutation of (1,2, 3 ) 
if (1,7, k) is an odd permutation of (1, 2, 3 ) 
if two or more indices are equal, 
we have 


( 2 . 13 ) 


u x v = e^uVg*. 


( 2 . 14 ) 


Problem 2 . 5 . 

Given u = 2 g t - g 2 + 4 g 3 , v = 2 g, + 3 g 2 - g 3 , where the basis {g ( } is given in Prob- 
lem 2.1, compute the cellar components of u x v. 


Solution. 

Let us write ( 2 . 12 ) in extended form for k = 1 , taking note of ( 2 . 14 ). 

/, 2 / l , 

(u x v), = €yi u‘v J = p\nU l V + Pi2l u V + ^f 3! W V' 

+ Zv + Zv + ^v 

= J(u 2 v 3 - uV). 

The roof components of u and v arc given, and from Problem 2 . 1 , 
10-1 
- 1 1 - 2 

2 1 1 


(u x v), = 6 [(-l)(-l)-( 4 )( 3 )]= -66. 
In like fashion, we find that for k = 2 and k = 3 , 
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(u X v) 2 = J(|»V - u'« 3 ) = 6[(4)(2) - (2)(— 1)] = 60 
(u x v) 3 = J(u l v 2 - uV) = 6 [(2) (3) - (- 1)(2)] = 48. 

To compute the roof components of u x v, we mimic the above procedure, 
but use roof instead of cellar base vectors. We obtain 

(u x yf = € ijk u l v J , (2.15) 

where u t and v } are the cellar components of u and v and the e‘ Jk are the 27 
roof components of the permutation tensor defined as follows: 

{ +J 1 if (i,j, k) is an even permutation of (1,2, 3) 

-J' 1 if (i,j,k) is an odd permutation of (1,2, 3) 

0 if two or more indices are equal. 

(2.16) 

In other words, € ijk = J 2 c‘ Jk . 

Here is an interesting identity connecting the components of the permuta- 
tion tensor and the Kronecker delta that you are asked to establish in Exer- 
cise 2.5: 

S' p 8‘ Si 

€ iJk € pqr = S J p SJ SJ . (2.17) 

s k s q k s r k 

If we expand the determinant and set r = k, we obtain 

€ iJk € pqk = 8$ ~ 8$, (2.18) 

which is intimately related to the vector triple product identity, (1.22). See 
Exercise 2.7. 

A Second Order Tensor Has Four Sets of Components 
in General 

This comes about as follows. Given a second order tensor T and a general 
basis {gj}, the action of T on each of the basis vectors is known, say 

Tgj = Tj. (2.19) 

Now each vector T^ may be expressed as a linear combination of the given 
basis vectors or their reciprocals. Choosing the latter, we may write 

Tj = Tjg 1 (2.20) 

The 9 coefficients T i} are called the cellar components of T. Explicitly, 

To = g> Tgj. 


( 2 . 21 ) 
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To represent T in terms of its components, we proceed just as we did in 
Chapter I. Thus if v is an arbitrary vector. 

Tv = T(tPg-), representing v in the basis {g,} 

= v j T gj , because T is linear 
= tr%g‘, by (2.19) and (2.20) 

= ^gV-v),by(2.7) 

= T i jg i gi(y), by (1.29). (2.22) 

As v is arbitrary, (2.22) implies that 

T = 1\j g l g\ (2.23) 

and we see that { g‘g J } is a basis for the set of all 2nd order tensors. 

Repeating the above line of reasoning, but with the roles of the cellar and 
roof base vectors reversed, we have 

Tg j = T J = T ij g t , T = T%g r (2.25) 

The 9 coefficients T iJ are called the roof components of T. In other words, the 
T iJ are the components of T in the basis {g.-gj}- The analogue of (2.21) is 

T iJ = g‘ Tg J . (2.26) 

The alert reader will now realize that there are two additional sets of 
components that can be defined, namely 

g' Tgy (2.27) 

and 

V = gyTg'. (2.28) 

These are called the mixed components of T. The dots are used as distin- 
guishing marks because, in general, T} } # Tj‘. It is easy to show that T has the 
following representations in terms of its mixed components: 

T = T.‘jgig J = Tj'g/gi, (2.29) 

i.e., the T.) are the components of T in the basis {g,g J } and the 7}'* are the 
components of T in the basis {g-'gj. 

Note that if T is symmetric, then T tj = g ; Tgy = g J T'g, = gj Tg, = T }i , 
Likewise T.) = 7} ‘ and T IJ = T Ji . However, T = T r does not imply that the 
matrices [7,’j ] and [7} ‘] are symmetric. See Exercise 2.23. 


Problem 2.6. 

Compute the cellar, roof, and mixed components of the tensor given in Problem 1.4, 
using the base vectors given in Problem 2.1 and the associated reciprocal base vectors 
given in the Solution to Problem 2.2. 
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Solution. 

We are given that 


Tv ~(-2 p i + 3v z ,—v z ,v x + 2v y ) 
g. ~ (1. - 1. 2), g 2 ~ (0, 1, 1), g, ~ ( - 1, - 2, 1). 


Tg, ~ (4, -2, - 1), Tg 2 ~ (3, - 1,2), Tg ~ (5, - 1, -5), 


Uy] = [g, 'Tgj] = 


From Problem 2,2, 

g 1 ~ (l/6)(3, - 1, 1), g 2 ~ (l/2)(— 1, 1, 1), g 3 ~ (1/6) ( — 3, - 1, l), 

and so 


g. Tg, g, Tg 2 

4 8 -4" 

g 2 Tg, 

-3 1 -6 

_ • • 

-1 1 — 8_ 


g‘Tg, g 1 Tg 2 


13 

12 

ll" 

g 2 Tg, 

= (1/6) 

-21 

.-H 

-6 

-6 

-33 

-19. 


IT.)] = [g‘ T gj ] = 


Finally, we have 

Tg 1 ~ (l/6)(- 3, - 1, 1), Tg 2 ~ (l/2)(5, - 1, 1), Tg 3 ~ (l/6)(9, - 1, - 5), 
yielding 

To o il 

[Tf] = [gj Tg‘] = | 4 0 -1 

3 -1 -2 

-7 51 23 

9 -45 -45 

11 —39 -31 


[T IJ ] = [g'Tg'j = (1/36) 


Change of Basis 

Within a given frame, vectors and tensors are blissfully unaware of the bases 
we choose to represent them. That is, they are geometric invariants. Under a 
change of basis it is their components that change, not they themselves. A 
major aim of tensor analysis is to provide recipes for computing new compo- 
nents from old ones once a change of basis has been specified. 

To concoct these recipes, let us start by assuming that each element of the 
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new basis is a known linear combination of the elements of old, say 

gi = ^igi + A \ 82 + A hi, I 2 = A \gi + •■ ’.83 = ••• (2.30) 

We may summarize (2.30) in cither matrix or index form as 

G = GA or g j = A%. (2.31) 

We shall need the inverse of (2.31) as well as the analogous relations between 
the new and old reciprocal bases. As det G = (det G)(det A) and {g, } and {g,} 
are bases, det A # 0. Therefore, A 1 = [(/T 1 )}] exists so, from (2.31), 

G = GA l or g j = (A~ l )%. (2.32) 2 

Furthermore, recalling that if a matrix B can be postmultiplied by a matrix 
C, then each row of BC will be a linear combination of the rows of C, we have 

G~‘ = A~ l G~ l or g^/T 1 )# (2.33) 

G -1 = AG~ l or g‘ = A)&. (2.34) 

The relations between the new and old components of any vector v follow 
immediately because 

Vj = gj-v = Afav, v‘ = g‘v = (A- l y jg J v (2.35) 

That is, 


iij = A}v h v‘ = (A 1 


(2.36) 


Likewise, for any 2nd order tensor T, 


T]j = g{ Tg ; = A k g k TA?g p , 
7/' = gj'Tg* = Ajg k T(A~ l )g p , 


f i j = g‘Tgj = (A- l )i^TA^ p 

f iJ = g‘ ■ Tg J = (A 1 )lg* • T(A ~ 1 y p gP 


That is, 


f iJ = A*AfT kp , f. i J = (A- l iA p j T k p 
fj 1 = A k j(A~ l )‘ p t; p , T ,J = (A ~ 1 )1(/1 ‘ 1 Y p T kp . 


(2.38) 


Problem 2.7. 


If 


1 2 1 

A = 2 10 

- 1 0 1 


and (v‘,v 2 ,v 3 ) = (2, 3, - 1), 


find (C 1 , ti 2 ,tr’). 


1 An advantage of index notation over matrix notation is that the order of multiplication is 
immaterial. Thus (A 1 )}g, = g,(/r% but, in general, G/T 1 # A *G. 



38 


II, General Bases and Tensor Notation 


Solution. 

Using the row reduction algorithm, we find that 

A = 1 -1 -1 

1 i 


Hence, from (2.36) 2 , 

S' - (-i)(2) + (1)(3) + (*)(- 1) = ? 
t?=(l)(2) + (-l)(3) + (-l)(-l) = 0 
w 3 = (-i)(2) + (l)(3) + (l)(-l) = |. 


Problem 2.8. 

Using the change of basis defined by the matrix A in Problem 2.7, compute f 2l and 
T 2 3 , where T is defined in Problem 2.6. 


Solution. 

From (2.38)! 4 we have 

f 2l = A\(A\T U + A]T l2 + AlT l3 ) 

+ A 2 2 (A\T 2l + A\T 22 + A 3 t T 23 ) 

+ A 3 2 (A\T 3l + AiT 32 + A{T 33 ) 

r 2 3 = + (A')lT ; 2 + (a-')\t; 3 ] 

+ AH(A') 3 T;' + ( A') 3 T i* + (A-'ftTi 3 ] 

+ + (A')lr; 2 + (A') 3 Ti 3 l 

The mixed and cellar components of T are given in the Solution to Problem 2,6 and 
the elements of the matrices A and A ' 1 in the statement and Solution of Problem 2.7. 
Thus all we need do is plug and chug: 

T 2l = (2) [(l)(4) + (2)(8) + (- l)(-4)] + (1) [(1 )( — 3) + (2)(1) + (- 1)(-6)] = 53. 
V = (2)[(-i)(0) + (0(4) + (f)(0)] + (l)[(-f)(0) + (0(0) + (f)(- 1)] = 


Exercises 

2.1 . Which of the following sets of vectors is a basis? 

(a) , g, ~ (4,6,2), g 2 ~ (1,0, l),g,~ (1,3,0) 

(b) . g, -(1,1,0), g 2 ~ (0, 2, 2), g 3 ~ (3, 0, 3) 

(c) . g, -(1,1,1), g 2 ~ (1,-1, l),g 3 -(-!,!,- 1) 
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2.2. l.et g, ~ (- 1,0,0), g 2 ~ (1, 1,0), g 3 ~ (1, 1, 1) and v ~ (1,2,3). Compute the re- 
ciprocal base vectors and the roof and cellar components of v. 

2.3. First simplify and then carry out explicitly any implied summations: 

(a) . 6-v.u 1 (b). 6jS{v k (c). SfS{ (d). e iu 6‘v k . 

2.4. The vectors g, , g 2 , g 3 , given in Exercise 2.2 form a basis. If the roof components 
of u and v are (2, 2, 1) and ( — 3, 1, 2) respectively, compute the three cellar com- 
ponents of u x v. 

2.5. Establish (2.17) by first setting (i,j,k) - (p,q,r) = (1,2,3) and then arguing 
about the value of both sides when ijk and pqr are various permutations and 
combinations of ( 1, 2, 3). 

2.6. (a). Establish (2.18) by expanding the determinant in (2.17) by, say, its first row 

and then setting r = k. 

(b) . Use (2.18) to show that e i]k e pJk = 2 S'. 

2.7. Establish the vector triple product identity (1.22) by first computing the roof 
components of u x v and then the cellar components of (u x v) x w. Finally, use 
the identity (2.18). 

2.8. Components of the identity tensor. Let 

ffij = grgj- ff iJ = g'g J 

Using the fact that 1v = v for all v, show that the various components of 1 arc 
hj = 9u, I! j = &j, V/ = 6}, 1 ii = g i ‘. 

Thus 

1 = g'gi = g.g' 

2.9. Raising and lowering of indices. Show that 

g‘ = d‘ k gk, gt = .9«g*> v ‘ = 0‘V f,- = 

T‘ =<?'%, T‘J = g ik T k \ T iJ = g ik T k , etc. 

As indicated, the effect of multiplying a component of a vector or tensor by g ik 
and summing on k is to raise or lower an index. Thus, for example, the index on 
v k is raised, becoming an i, by multiplying v k by g ik . 

2.10. Noting that [ 3 ;j ] = G T G, show that 

(a) . det[r/J = J 2 

(b) . g tk g kJ = <ij- 

2.11. Show that 

(a) , g, x gj = «,v*g* 

(b) g* = K J *g, x gj 

212. If u = u' g, = u,g', find formulas for the four different components of the 2nd 
order tensor u x . Use the following notation and procedure. 

("4j=gi (u X gj)=U (gj X ft) = -u = ••■ 
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2.13. A 3rd order tensor is a linear operator that sends vectors into 2nd order tensors. 
Any 3rd order tensor may be represented as a linear combination of triads. A 
triad is a direct product of three vectors (u, v, w), denoted by uvw and defined by 
its action on any vector x as follows: 

uvw ■ x = uv(w • x). 

Here is where the dot product, as a contraction operation, comes into its own. 
The double dot product of a triad acting on a dyad is a linear operator that 
produces a vector according to the rule 

uvw xy = u(wx)(vy). 

Finally, and obviously, we define the triple dot product of a triad acting on a 
triad as a linear operation that produces a scalar, according to the rule 

uvw ■ • xyz = (w • x)(v • y)(u • z). 

The 3rd order permutation tensor may be denoted and defined by 

P = €y»gVg‘. 

Show that 

(a) . P wvu = u(v x w) 

(b) . P VU = U X V 

(c) . P U = -ux. 

(d) . Let I x I be the 3rd order tensor defined by 

u (l x I) • v = u x v Vu, v. 

Show that P = — 1 x 1. 

2.14. Show, using (2. 17), that 

(a x b)(c x d) = da (be) + cb(da) — db(c a) — ca(d-b) 

+ [(a c)(b d) - (a d)(b c)]l 
= T S — (§T S)l, (2.39) 

where, recalling Exercise 1.13(b), 

S = ba - ab = (a x b) x , T = dc - cd = (c x d) x 
This result is useful in describing finite rotations. See Exercise 2.19. 

2. 15. Any change of basis g t = A) g, can be effected by a nonsingular 2nd order tensor 
B such that g ; = Bg r Show that 

(a) . B‘j = A). 

(b) . g = (B ~ 1 ) r g‘. 

2.16. An orthogonal tensor Q satisfies Q T Q = 1, i.e. Q 1 = Q ' 

(a) . Let Q = [e^Qe^], where {ej is the standard Cartesian basis. Show that 

Q T Q = /• 

(b) . Show that det Q = + 1. Hint: det AB = (det A)(det B ) (If det Q = + 1, Q is 

called a proper 1 tensor or rotator ) 

(c) . Show that under the 1 change of coordinates g, = Qg Jf the cellar and roof 

components of 1 (i.e., g tj and g (i ) are unchanged. 
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(d) . Explain why the columns or rows of Q may be regarded as the Cartesian 

components of mutually 1 unit vectors. 

(e) . If 

Q = Q‘jgtg J = Q'/g'gi, 

show that 

Q r = Q‘j g J gi = QUigK 

and hence that 

QkQ*j = <*/ • 

217, A reflector is an 1 tensor H that reflects vectors across a plane with normal n, as 
indicated in Fig. 2.4. 

(a) . Show, geometrically, that 

H = 1 - 2nn. (2.40) 

Hint: What vector when added to v gives Hv? 

(b) . Show that H 2 = 1. 

(c) . With the aid of Cartesian components, show that det H = — 1. (Hint: take 

e, = B). 

(d) . If n ~ (1, -2,3), and v ~ (v x ,v y ,v z ), fill in the blanks: 


(e). If u and v are distinct unit vectors, show that 

UV 4- vu — uu — vv 

H = l+ + — — (2.41) 

is a reflector that sends u into v and v into u. 

2.18. A 3-dimensional rotator R is characterized by an axis of rotation, with direction 
e, and an angle of rotation 0, reckoned positive by the right-hand rule. Fig. 2.5 
shows a vector u that has been rotated about e through 0 into the vector v = Ru. 



Figure 2.4. 
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(a) . By resolving v into components along the three mutually 1 vectors 

e x u, and e x (e x u), show that v = u + (sin0)e x u + (1 — cos0)e : 
(e x u) = (cos 0)u + (1 — cos 0)e(e- u) + (sin 0)e x u, and hence that 

R = (cos 0)1 + (1 - cos 0)ee + (sin 0)e x . (2.4: 

(b) . Show that Re = e. 

(c) . Using Cartesian components, show that det R = +1. 

Hint: take e x = e. 

(d) . Show that 

R T = (cos 0)1 + (1 - cos 0)ee - (sin 0)e x 

(e) . Show that in terms of the finite rotation vector 

r = 2 tan(0/2)e, (2.4: 

R = (1 + ir r) '[(l - ir r)l + §rr + rx], (2.4- 

(The advantage of introducing r is that (2.44) is a rational function of th 
components of r; the disadvantage hits when 0 = it.) 

(f) . If r — (1,1, 1), compute 0 and the matrix of Cartesian components of R. 

2.19. Given two non-parallel, three-dimensional unit vectors u and v, show, by takin 
sin 0 e = u x v and using (2.39), that the rotator that sends u into v about an ax 
1 to u and v is given by 

R = I + vu - uv + (1 + u-v)' ! [(uv + vu)(u v) -(uu + vv)]. (2.4. 

As (2.45) is free of the cross product, it is valid in any number of dimension 
(With (2.40) and (2.41), or (2.42) and (2.45), we can construct, in two differei 
ways, 1 tensors with determinant equal to — 1 or + 1, respectively.) 

2.20. Find the 2x2 matrix of the Cartesian components of the rotator that sends « 
into the unit vector e x cos0 + e y sin 0. 
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2.21. Cartesian tensor notation means writing all indices as subscripts and may be 
used whenever g' = gj. Show that this happens if and only if g t = Qe,, where Q 
is an 1 tensor. Invariant forms are often most easily established by introducing 
the standard Cartesian basis {ej and using Cartesian tensor notation. Thus, 
with the aid of (2.18), 


(u x v) • (u x v) = e ijk ujv k e iv u q v r = (S Jq S tr - SjA q )UjV k u q v r 

= u q v r u q v r — u r v q u q v r = (u • u)(v • v) - (u • v) 2 . 

2.22. Given a basis {g, ,g 2 ,g 3 }, explain the geometrical construction of the reciprocal 
basis {g\g 2 ,g 3 }. 


223. Given the 2-dimensional 2nd order tensor Tv ~ (« x — v y , t? x ), where v ~ (v x , v y ), 

(a) . Determine its symmetric and skew parts, S and A. 

(b) . If g, ~ (1,0) and g 2 ~(1, 1), compute the matrices [T*j], [7}' 1 ], [S! ; ] and 

[Sj 1 ]. Are these last two matrices symmetric? 

2.24. Let {g,} be a basis, T any 2nd order tensor, and set h, = Tg, 

(a) . Show that T = hjg*. 

(b) . Show that i(h,g' - g‘h,) is the skew part of T. 

(c) . If T is 3-dimensional, show that $g‘ x h f is the axis of its skew part. (See 

Exercise 1.18). 

(d) . If {g,} is the basis given in Problem 2.1 and H, ~(1,0, -1), h 2 ~ (2, 1,0), 

h 3 ~ (0, 1, 1), compute the Cartesian components of T and the axis of its 
skew part. Note that {g‘} is given in the solution to Problem 2.2. 


2.25. Show that the rotator R defined by (2.42) has the alternative exponential repre- 
sentation 


R 


S 1 + 0e X 


(flex ) 2 

2! 


(flex) 3 
3! + 


(2.46) 


as follows: 

(a). Noting that 

u x vw = (u x v)w and u x 1 = u x (2.47) 


in the sense that 


(u x vw) x = (u x v)w-x and (uxl) x = uxx, Vx, 
show that (1,49) implies 

(ex) 2 =ee-l, (ex) 3 =-ex, (exf= -(ex) 2 = -(ee-I),..., 
that is, 

(e x) 2 " = ( - l)" +l (ee — 1), (e x ) 2 ” +l = (- l)”e x , n=l,2,.... (*) 

(b). In (2,42) set (cosfl)l = 1 - (1 - cosfl)l and use (*) and the Taylor series 
expansions 


_ 0 2 _ A 4 

~ 2! 4! + 


to obtain (2.46). 


1 — cos 0 
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2.26. If e is fixed, R is a function of 0 only. Find a 1st order differential with initial 
condition for R(0). 

2.27. Just as any non-zero complex number z can be represented in the polar form 
z = re iB , r > 0, so any non-singular 2nd-order tensor T has the polar decomposi- 
tion 

T = V R = V <? #e \ 

where V is a positive definite and R is a rotator. Given T, use your knowledge 
of linear algebra to explain how you would first compute V and then R. 

2.28. Work Exercise 2.5 and then, using the same arguments, 

(a) . Show that if A = [/!}] is a 3 x 3 matrix, then (2.17) generalizes to 

4 4 A‘ 

e iJk e Mr det A = A J „ Aj, A{ . (2.48) 

A k A k A) 

(b) . What is the generalization of (2.18)? 

(c) . Show that by setting q = j and r = k in (2.48) and expanding the determi- 

nant on the right side, we obtain the Cayley-Hamilton Theorem (“a matrix 
satisfies its own characteristic equation”): 

A)AlA k p - (A k k )A‘jAl + (l/2)[(4) 2 - A{A k ^A\ - (det-4)^ = 0 (2.49) 
which, in index-free form, reads 

A 3 -(UA)A 2 + (l/2)[(tr v4) 2 -ltA 2 ]A - (del A)I = 0. (2.50) 



CHAPTER III 


Newton’s Law and Tensor 
Calculus 


Newton’s Law of Motion is studied in introductory courses in calculus, phys- 
ics, and dynamics. Being familiar, fundamental, and simple, Newton’s Law is 
an ideal vehicle for introducing many of the key ideas in tensor calculus. 

In its most primitive form, Newton’s Law states that, in an inertial frame, 
the motion of a mass-point p obeys 

f = mx, (3. 1) 1 

where f is the force acting on p, m is its mass, x is its position with respect 
to a fixed origin 0 in the inertial frame, and a dot- Newton’s notation — 
denotes differentiation with respect to time. 


Rigid Bodies 

Rigid bodies in classical mechanics are composed of mass-points that remain 
a fixed distance apart and exert on one another mutually parallel forces only. 
It may be inferred from (3.1) that such bodies obey the gross form of Newton's 

Law , 

F = L, (3.2) 

where F is the net external force acting on the body, 

L = MX (3.3) 

is its linear momentum, M is its mass, and X the position of its center of mass. 2 

1 According to Truesdcll, Newton never stated his law in this form; Euler first did' 

1 See, for example, Goldstein’s Classical Mechanics , 2nd ed. 
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It may also be inferred from (3.1) that a rigid body of classical mechanics 
obeys the equation of conservation of rotational momentum, 

X x F + T = R 0 , (3.4) 

where T is the net torque on the body and R 0 is the rotational momentum 
about 0. 

In continuum mechanics, (3.2) and (3.4) are taken as postulates, valid for any 
body, while (3.3) follows from the definition of center of mass; rigid bodies are 
defined by a special constitutive assumption, without any reference to “atom- 
ic” forces. See Exercises 4.19 and 4.22. 

Combining (3.2) and (3.3), we obtain 

F = MX, (3.5) 

This law, though a gross one, yields, by itself, useful results for the motion of 
bodies that are nearly rigid and either nearly spherical or else rotating at a 
known rate, and small in the sense that their diameters are much less than the 
scale of the field of interest. I have in mind raindrops falling on my head, golf 
balls soaring over fairways, a record throw of a shot (but not one of a javelin 
or discuss), a satellite tumbling in space, or an asteroid orbiting the earth. 

As (3.1) and (3.5) are of the same form, we shall work with (3.1), but leave 
the physical interpretation open. 


New Conservation Laws 

The simplicity of (3.1) — the coordinate-free form of Newton’s Law - suggests 
the following manipulations. 

(a) Take the dot product of both sides of (3.1) with x to obtain 

fx = fflS'X = (jmx'i)' = L (3.6) 

In words, the external power (f- x) is equal to the rate of change of kinetic- 
energy (k). 

(b) Take the cross product of both sides of (3.1) with x and note that x x x = 
o to obtain 

t 0 sxxf=xxM = (xx mx)' = r 0 , (3.7) 

i.e„ the torque t 0 about the origin is equal to the rate of change of the 
rotational momentum r 0 about the origin. 

(c) If there exists a potential t>(x) such that 

fx = -v, (3.8) 

then f is said to be conservative, and (3.6) implies that 


constant. 


(3.9) 
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i.e., energy is conserved. Often we can solve simple problems with the aid 
of (3.9) and thus avoid having to solve a differential equation. See Exer- 
cise 3.2. 

(d) See Problem 3.2. 


Nomenclature 

The set of all points C occupied by a mass-point p between two times, say a 
and b, is called the orbit of p. The point on C occupied by p at time t may be 
denoted by P(t). If C is represented in the parametric form 

C. x = *(t) = *(f)e, + + i(f)e 2 , a < t < b, (3.10) 3 

then the vector function & is called the trajectory of p, and x the radius vector 
to C. 

If x is dilferentiable, then 

x = P(t), a < t < b, (3.11) 

is called the velocity of p (at l) and |$(t)l its speed. £ is said to be smooth at t 
if x is continuous at t and smooth (everywhere) if x is continuous for all 
t e (a, b). A 

If £ is smooth, then $(t) is tangent to C at P(t). This follows immediately 
from the definition 

Ax 

*)(t ) = lim — , Ax = x(t + At) - ft(t) 

AI^O A t 

and from Fig. 3.1, which shows that the vector Ax/A t, being parallel to the 
vector Ax from P(t) to P(t + At), must approach the direction of the tangent 
line at P(t) as At -» 0. 

The length s along a smooth trajectory, measured from a point P(t , ), satis- 
fies the differential equation 

dsjdt = |*(t)|, s(t , ) = 0, (3.12) 

i.e., 

s = JW)|dT. (3.13) 

If x is differentiable, then 

3 Here and henceforth we adopt a useful notation: if / denotes a function, then its value at t is 
denoted by either /(/) or f the latter notation to be used whenever the functional form of / is not 

4 Note that a smooth trajectory need not have a smooth orbit. Consider a point p on the rim of 
a rolling wheel. Its trajectory is smooth but its orbit, a cycloid, is not: at points where p touches 
the ground, the velocity is zero but the orbit has a cusp. 
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AX /At 




x = v s a(() 

is called the acceleration of p (at t). 

Problem 3.1. 

If fi is smooth, show that the area swept out by x as t varies from a to b is 
= |xxi|<ft. 


Solution. 

From Fig. 3.2 it is seen that the area of the shaded triangle, being half the 
parallelogram having x and Ax as co-terminal edges, is given by 

A/4 = 2 |x x Ax|. 


(3.14) 

given by 

(3.15) 

area of the 
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As 3t|v| = |av|, A/l/Ar = ]|x x Ax/At|. Hence, in the limit as At -+ 0, 

dAjdt = i|x x x|, (3. 16) 5 

which implies (3.15). 

Problem 3.2. 

Show that in a central force field (f parallel to x) 

(i) x lies in a plane. 

(ii) Kepler’s Law holds: x sweeps out equal areas in equal times. 

Solution. 

(i) If f is parallel to x, then x x f = o and (3.7) implies that 

x x x = c, 

a constant vector. Thus 

x • c = x • (x x x) = x • (x x x) = 0, 

i.e., x lies in a plane. 

(ii) It follows from (3.15) and Problem 3.1 that dAjdt = ||c|- i.e 
||c|Al. This is Kepler’s Law. 


(3.17) 


(3.18) 

., A(t + At) - A(t) = 


Newton’s Law in Cartesian Components 


The four conservation laws derived under (a)-(d) are simple and useful and 
epitomize the virtues of expressing the laws of mechanics in coordinate-free 
form. However, except in special cases, these conservation laws alone cannot 
provide all of the details of the motion of a mass-point. To extract full infor- 
mation from Newton’s Law, we must express (3.1) in component form. In 
specific cases, we try to choose a coordinate system in which the components 
of f simplify. 

In the simplest case, the Cartesian components of f are known functions of 
position and time, so we set 


From (3.10) 


f = /*e*+/ y e y +/ z e z . 


x = xe x + ye y + ze z . 


whence 


(3.19) 

(3.20) 


f x = mx, f y = my, f z = mz. 


(3.21) 


5 For those who like a little more precision and detail: Assume that x = lim Ax/A t exists at Pit). 
Then the right side ol'(3.16) exists at P(t). By the triangle inequality, |^|x x x| - 3|x x Ax ,At|| < 
)|x x x - x x Ax/ At | - 1 1 x x (x - Ax/ A()| < 11*11* - Ax/At| -0. 
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Newton’s Law in Plane Polar Coordinates 


To study the motion of a planet about the sun or, more generally, the motion 
of a point-mass under any central force (say a ball-bearing, attached to a 
rubber band, whirling around on a horizontal, frictionless table), we would 
want to introduce plane polar coordinates ( r,0 ) defined by 

x = rcos0, j/ = rsin0. (3.22) 

With the change of variables (3.22), the position of a mass-point takes the 
form 


x = fL(r,0) = r cos0e x + rsin0e y , | x = i(u J ) = *V)e, (3.23) 6 

where r and 6 are unknown functions of time. By the chain rule, 

v = i = (dx/dr)r + (dx/d0)0, j v = (dx/du‘)u‘ (3.24) 

From (3.23), 


g r = dx/dr = cos 0e x + sin 0e y 

j g f = dxjdu' = (dx k /du i )e k . 

g„ = dxjdO = -rsinfle* -I- rcos0e r 
Since 


J( gr,g«) = 


cosd 

sin0 


-rsinO 

rcosO 


I 

j J(g!,...) = det[x‘ y ] 

I 


(3.25) 


(3.26) 


{gr.gfl} >s a basis at every point of the plane except the origin. (Such excep- 
tional points are the bane and beauty of many useful coordinate systems. 
Often the field equations of physics and mechanics admit solutions that are 
singular at exceptional points. Sometimes these solutions must be discarded 
on physical grounds. Yet other times they represent such useful idealizations 
as sources, sinks, vortex lines, concentrated forces, or black holes!) 

We shall call g, and g 0 the cellar base vectors of the (r, 0) coordinate system , 7 
and denote their coefficients in (3.24) by 

v r = r, v e = 0. ] = (3.27) 

6 A number of equations that follow have two parts, separated by a vertical dashed line. The 
right half represents the general tensor form and will be discussed presently, 

7 Here and henceforth we omit the qualifier “aside from exceptional points" whenever we speak 
of the cellar (or roof ) base vectors of a coordinate system. 
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These are the roof components of v. With this notation, (3.24) reads 

v = v r g r + v'u j v = v‘g,. (3.28) 

The geometric interpretation of g, and g e is simple. Through any fixed 
point P. with polar coordinates (r*, 0*), except the origin, there passes a uni- 
que r-coordinate curve with the parametric representation &(r,0*), and a uni- 
que O-coordinate curve with the parametric representation £(>•„,, 0). By the 
same argument we used to show that v = x is tangent to the curve traced out 
by x as t varies, g, = dxjdr and g„ = dx/dO are tangent, respectively, to the 
curves traced out by x as r or 0 varies. Fig. 3.3 shows the r- and 0-coordinate 
curves that pass through a typical point P and the associated base vectors g, 
and Also indicated is the decomposition (3.28). 


The Physical Components of a Vector 

The physical component of a vector w in the direction of a vector u is defined 
to be w u. Thus if v {r) and v te> denote, respectively, the physical components 
of v in the directions of the roof base vectors 

g r = cos0e x + sin0e > „ g 8 = r"'(-sin0e x + cos0e,,), (3.29) 

then from (3.27) and (3.28) 
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= v • g' = v r = r | 

j t/° = vg‘ = uVIg'l (3.30) 

»<«> = vg 9 = rt> 9 = r0. | 

These expressions agree with the ordinary definitions of radial and angular 
velocity. 

To compute the acceleration in plane polar coordinates, we differentiate 
(3.28) with respect to time, so obtaining 

a = v = v'g, + v'g, + 6% + v 9 gg. j a = v% + v%. (3.31) 


The Christoffel Symbols 

The compute & and g fl we must use the chain rule for, by (3.25), g r and g # are 
functions of r and 0 and these, in turn, are (unknown) functions of t. Thus 

g, = (dg r/dr)r + (dg r /dd)d J 

= v r g r ,r + v $ g r ,e I & = (dgi/8u J )u J 32) 8 

ge = (d&/<W + (dge/ddjO j = v J gij 
= v r g e , + v e go, e , | 

where a comma indicates partial differentiation with respect to the variable(s) 
whose index (indices) follow it. Being derivatives of vectors, g r ,, g r e = g^,- 
and g e e are themselves vectors. Therefore each can be written as a linear 
combination of gr and go, say 

gr.r = ITgr + T^g# 


(3.33) 

gr.e = g«.r = r re g r + r%g e j g t J = g u = r^g* 

g«,e = r r ee g r + r^g e . | 

The coefficients of g r and g« in (3.33) are called the Christoffel symbols of the 
(r, 0) coordinate system. Substituting, (3.33) into (3.32), we have 

g r = t’ r (r; r gr + r?rg„) + ^(I^gr + T%g e ) | 

I & = t J r t,g*. (3.34) 

ge = ^(Cegr + I^ege) + f^gr + T» e g e ) j 
Finally, we place these relations into our expression for the acceleration, 
(3.31 ), and collect coefficients of g r and g„, to obtain 

8 Note that g,, e = x >rt) = x, #r = J g,./ = \,j = *./, = gj,. 



The Christoffel Symbols 


53 


a = [t' r + (C r u r + n e p 8 )t; r + (r^ r +r^t;®)r®]g r 


+ [i) 6 + (r® r f r + r r W + (rt 9 D r + rg„t> V’jg,, 

= a r gr + «V 


a = (p* + nVr^g* 

= a k g k . 

(3.35) 

We call a r and a e the roof components of a in the (r,0) coordinate system. 

Things are not really as complicated as they appear because a number of 
the Christoffel symbols vanish. From (3.25) 


gr.r = 0 

8r,e = g e, r = -sin0e x + cos0e, = r~‘gfl 
g 0 0 = - r cos 0e x - r sin 0e y = —r g,. 


g ,j = x$je p (3.36) 

= x%u k „g k . 


Comparing (3.33) with the above, we have, by inspection, 

r; r = r? r = r; e = r2„ = o j 

I n, = x 'tjU k P , (3-37) 

r? e = r- 1 ,rs e = -r, ! 

so that the formula for a with v r = r and v e = 0 reduces to 

* = (r — r0 2 )g r + (0 + 2r~ i r0)g e a = (u k + u‘u J x p u u k „)g k 
= a r g r + a e g e . | = a*g». 


(3.38) 


Note that the physical components of the acceleration in the directions of 
g r and g 9 , are given, respectively, by 


a (r ' = a g r = a r = r - r0 2 
a m = a g 9 = ra 9 = rd + 2 rd. 


a U) = a .gi = («• + u p u«rj,,)/|g'| (3.39) 


formulas that you will recognize from calculus, physics, and dynamics. 

To get the component form of f = mu, we assume that the roof compo- 
nents of f are known. Then Newton’s Law may be written as 

f gr + /* go = m(a r g r + a e g e ). j f‘g t = ma'g, (3.40) 

But {gr, g 9 } is a basis, which implies that 


/' = ma r = m(r — rd 2 ) 
f e = ma e = m(d + 2 r'V0). 


/‘ = m(ii‘ + u p u 4 r{,„) 


(3.41) 


Now the motive for introducing polar coordinates in the first place was the 
expectation that, in a central force field, the component form of Newton’s 


54 


III. Newton’s Law and Tensor Calculus 


Law would simplify. And indeed it does. Note that in (3.41 ) 2 , 0 + 2 r~'fO = 
r 2 (r 2 0)'. Hence, if f e = 0, r 2 0 = r 0 , a constant — Kepler’s Law! Combined 
with (3.41),, this relation allows us to reduce the determination of the orbit 
of a mass-point to the solution of a first order differential equation, provided 
that f depends on r only. See Exercise 3.7. 


General Three-Dimensional Coordinates 

A convenient way to describe the position of a satellite orbiting the earth is 
to give its distance p from the center of the earth, its co-latitude <f> (i.e., the 
co-latitude of a point on the earth’s surface directly beneath the satellite), and 
its azimuth 6 from a non-rotating plane II through the polar axis. Such a 
system of spherical coordinates (p,4>,Q) is defined by 

x = psin^cos# 

y = psin^sin0 (3.42) 

z = p cos <t>, 

where (x,y,z) is a set of Cartesian coordinates with its origin at the center of 
the earth and the xy- and xz-planes coincident, respectively, with the plane of 
the equator and the plane IT. With the restrictions 0 ^ p, 0 <, <l> <, n, 0 <, 6 < 
2n, the transformation (3.42) becomes 1 : 1, except along the z-axis which is a 
locus of exceptional points. 

A study of the vibrations of a triclinic crystal in which the atoms of a cell 
lie at the vertices of a parallelepiped provides another example where a 
coordinate system other than Cartesian is preferable. Here we might intro- 
duce oblique Cartesian coordinates (u, v , w) defined by 

x = A\u + A' 2 v + A 3 W 

y = A\u + A\v + A\w (3.43) 

z = AiU + A\v + A 3 W, 

where — oc < u, v, w < 00. The constants A) would be chosen so that the 
coordinate curves are parallel to the skewed lines along which the atoms 
lie. As det[A!] # 0, the transformation is 1 : 1 and there are no exceptional 
points. 

Still other problems (for example the determination of the trajectory of a 
particle rising in a hot, tall chimney) call for circular cylindrical coordinates 
(r, 0, z), defined by 

x = rcos0, y = rsin0, z = z, (3.44) 

where 0 < r, 0 < 0 < 2n, — 00 < z < 00. This transformation is 1 : 1 except 
along the z-axis. 

Equations (3.42) to (3.44) are each examples of a coordinate transforma- 
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tion of the form 


x = x(u, v, w), y = j )(tt, v, w), z = 2(u, v , w). (3.45) 

Instead of using different letters to denote different coordinates, let us set 
x = x 1 , y = x 2 , z = x 3 and u = u', v = u 2 , w = u 3 . (The superscripts repre- 
sent indices, not powers.) Then (3.45) reads 

x 1 = X'(u',u 2 ,u 3 ) 

x 2 = £ 2 (u\u 2 ,u 3 ) (3.46) 


Each of these relations is of the same form. We emphasize this by rewriting 
(3.46) as 

x‘ = $V,« 2 ,« 3 ), 1=1,2, 3 (3.47) 

However, there is no need to continually remind ourselves that we are work- 
ing in 3-dimensions. Indeed, the beauty of tensor notation is that it reveals 
those relations that hold in any (finite) number of dimensions. Thus we shall 
drop explicit reference to the range of the free index and so replace (3.47) with 

x‘ = £'(«', m 2 ,m 3 ). (3.48) 

One final compression of notation. Let us agree that in the argument of a 
function, a sequence of variables such as u 2 , u 3 may be replaced by a single 
symbol, say u J . The only provision is that the symbol for the index (j in this 
case) be distinct from the symbols for any other indices in the term in which 
x‘ appears. This boils (3.48) down to its essence: 

x* = * V)- (3.49) 

The index j in (3.49) may be called an argument index. 


Newton’s Law in General Coordinates 

Newton’s law in general coordinates is obtained by mimicking the steps that 
led us from the expression for the position of a mass-point in polar coordi- 
nates, (3.23), to the formula for its acceleration in terms of roof components 
and cellar base vectors, (3.38). Our first thought is that things are bound to 
be more elaborate. However, because we refuse to tie ourselves to any partic- 
ular coordinate system, certain features of our equations that are common to 
all coordinate systems come into focus, each cluster of formulas exhibiting a 
simple pattern that is captured by tensor notation. 

Several of the equations that follow have a counterpart in polar coordi- 
nates, indicated by an equation number in brackets. Each of these latter 
equations has its generalized tensor form listed on its right. Your respect lor 
the economy of tensor notation will grow as we proceed. 

We begin by relabelling the Cartesian base vectors as follows: 
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e, = e x , e 2 = e„, e 3 = e r . (3.50) 

With the change from Cartesian to general coordinates defined by (3.49), the 
representation for the position of a mass-point takes the form 

x = £(iP) = + * 2 (iP)e 2 + £ 3 (tP)e 3 

= *V)e*. (3.51) [3.23] 

As x depends on the u J and these, in turn, are (unknown) functions of t, we 
have, by the chain rule, 

v = x = (dx/8u')i i 1 + (dxjdu 2 )u 2 + (8x/du 3 )u 3 

= (dx/dufu 1 . (3.52) [3.24] 

The three vectors 

g t = dx/du* = x ti = x*,e k (3.53) [3.25] 

are called the cellar base vectors of the u j -coordinate system. They form a 
basis at any point P where J( g, , g 2 , g 3 ) # 0 and are tangent to the associated 
coordinate curves that pass through P. We represent these base vectors by 
arrows with tails at P, although occasionally, as with the Cartesian base 
vectors e„ we shall place the tails at the origin. Fig. 3.4 is typical. 

Problem 3.3. 

Compute the cellar base vectors and the Jacobian for the system of spherical coordi- 
nates (3.42). 

Solution. 

When dealing with specific three-dimensional coordinate systems, it is frequently 
more tidy, typographically, to revert to using distinct letters for different coordinates. 
Thus, with 



Figure 3,4. 
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'e have, from (3.42) and (3.53), 
gi = g„ = 
g 2 = g* = 
gs = g« = 


•/(gp,g^g«) = 


= sin^cosfle* + sin^sin0e y + cos^e* 

(3.54) 

k j = pcos</>cos0e x + p cos ^ sin 0e y — , 

p sin <fie. 

(3.55) 

t „ = — p sin <i> sin 0e x + p sin <fi cos 0e y 


(3.56) 

sin ^ cos 0 sin ^ sin 0 cos<fi 

p cos <fi cos 0 pcosifismO — psin^ 

1 = p 2 sin (fi. 

(3.57) 


-psin^sin# psinficosO 


The coefficients 

v‘ = u 1 (3.58) [3.27] 

of the g, in (3.52) are called the roof (or contravariant) components of v. In the 
new notation defined by (3.53) and (3.58), the component representation for 
the velocity reads 

v = (3.59) [3.28] 

The physical components of v in the directions of the reciprocal base vec- 
tors g' are, by definition, 

v< ( > = v • g‘ = v • gVlg'l = a'/lgi. (3.60) 9 [3.30] 

To compute the acceleration, we differentiate (3.59) with respect to time, so 
obtaining 

a = v = p'g, + v%. (3.61) [3,31] 

The gj are functions of the general coordinates which, along the trajectory, 
are (unknown) functions of time. Thus, by the chain rule, 

g; = (dg i/du')u l + (dgi/8u 2 )u 2 + (dg { /du 3 )u 3 
= (dgi/du J )u J 

= v J g,j, (3.62) [3.32] 

where we have used, successively, the summation convention, the comma 
convention for partial differentiation, and (3.58). 

The vectors g t J can be expressed as linear combinations of the g k in the 
form 

s,j = rfg, + r* g2 + r£g 3 = r* jgk . (3.63) [3.33] 

Note that g , ; J = d 2 x/du‘du J = d 2 x!du i du i = g jA , i.e., g tJ is symmetric in the 
indices i and j. This implies that Ty = T*,. The Ty are called the Christoffel 
symbols of the u j -coordinate system. 

Substituting (3.63) into (3.62) and (3.62) into (3.61), we obtain for the accel- 


9 Recall that two dummy indicies on the same level (on the roof in this case) are not to be 
summed over. 
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eration, 

a = l,i g( + DVrJjg,. (3.64) 

We now throw a move that is typical in tensor analysis: we change the 
dummy index in the first term on the right from i to k. This allows us to write 

a = t> k g* + t>vr{f,g* 

= (t>* + = a k g k . (3.65) [3.35] 

The a k are the roof components of a (with respect to the basis {g fc } ). 

The physical components of a in the directions of the reciprocal base vec- 
tors g‘ are, by definition, and from (3.58) and (3.65), 

a u> _ a r = a'/ Igi = («‘ + tiWg/|gi. (3.66) [3.39] 

When f is represented in terms of the basis {g*}, Newton’s Law, f = m a, 
takes the form 

f k g* = ma k b (3.67) [3.40] 

which implies that 

/* = ma k = m(u k + u'lPrf;). (3-68) [3.41] 

Computation of the Christoffel Symbols 

Computation of the Christoffel symbols in any specified coordinate system 
is straightforward. Indeed, there exist computer programs that do this by 
manipulating symbols— not numbers - just as we are about to do. 

(a) Start from (3.51) and compute the Cartesian components of g fJ . 

(b) Compute the roof base vectors, i.e. find the inverse of the Jacobian 
matrix. 

(c) Take the dot product of both sides of (3.63) with g p , p being a free index. 
This yields g p • g uj = r' I / g p g* = r{j<5* p = T? . Or, renaming the free index, 

H, = g*’g,j- (369) 


Problem 3.4. 

Compute the Christoffel symbols for the u J -coordinate system defined by 


x^OP) 2 -^ 2 ) 2 . 


Solution. 

As mentioned earlier, when dealing with specific coordinate systems, it often simplifies 
the typography to set x = x 1 , y = x 2 , ..., x 1 , = x >u , g! = g u , etc. Thus, recalling our 
abe’s, we have 
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(a) 


x = *V> ( ~ (uv,w 2 ,u 2 - v 2 ). 

g u ~ (v, 0, 2 u), g„ ~ (u, 0, - 2d), g w ~ (0, 2w, 0) 

g„. u ~ (0,0,2), g u ,,~ (1,0,0), g u , w ~ (0,0,0) 

S„.u ~ (0, 0, — 2), (0,0,0) 

g„.„~ (0,2,0). 


(b) 


v u 0 
G = 0 0 2 w 

_2u —2 d 0. 


and hence J = 4w(u 2 


D 2 ). 


Rather than carry out the simultaneous row reduction of [G|/] to obtain [7|G '], 
it is simpler, since we are dealing with a 3 x 3 matrix, to use the formula (no sum on 
for j) 


G-‘ = J ‘[(-1)^], 


(3.70) 


where M tj is the determinant obtained from G by deleting its ith row and ,/th column. 
(See Hildebrand, Methods of Applied Mathematics , 2nd ed., pp. 16-17.) Thus 



4dw 

0 

2uw 


g" 

G l = r 1 

4uw 
. 0 

0 

2 u 2 + 2d 2 

— 2vw 

0 

~ 

g" 

.g w . 


(c) From (a) and (b) 

r;„ = g“ g u . u = J' 1 [(4dw)( 0) + (0)(0) + (2uw)(2)] 

= U/(U 2 + D 2 ) 


rj c = g“ 'g..,, = j 1 [(4dvv)(1) + (0) (0) + (2uw)(0)] 

= D !(U 2 + D 2 ). 

Continuing in this fashion, we find that 

- n. = = u/(u 2 + d 2 ), -ri = r v w = d/(u 2 + d 2 ) 


r:„ = i/w, 

while the remaining 1 1 Christoffel symbols are all zero. 


An Alternative Formula for Computing 
the Christoffel Symbols 

An alternative formula for computing the Christoffel symbols is 

n, = i 9 kp (9i P ,j + 9 ip ,i ~ au.pl (3.71) 

where, as explained in Exercise 2.8, g ,J = g‘ g J and g tJ = g. g^ are the roof 
and cellar components of the identity tensor 1. This formula has two advan- 
tages over (3.69). 
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First, it is applicable as it stands to general relativity where the indices 
range from 1 to 4 and where the g u — there called the covariant components 
of the metric tensor reflect the distribution of matter in the universe. 

Second, (3.71) is efficient in orthogonal coordinates, i.e., in coordinate sys- 
tems where the g, are mutually 1 (though not necessarily of unit length). In 
an orthogonal coordinate system, the matrix [g u ] is diagonal , as is In 
this event, (3.71) reduces to 

rf, = l 29 kk (g ik j + g jk .i - 9^) (3.72) 

This formula may now be broken into 4 mutually exclusive cases. 

(a) i, j, and k are distinct. In this case, 

ns, = 0, i # j # k # i, (3.73) 

because [# y ] is diagonal. In n-dimensions there are n(n - l)/2 Christoffel 
symbols of this type. 

(b) i and j are distinct but k is not. In this case, because T y = r‘ f and g u = 0, 

r\j — W'Qtij, 1 ^ j (3.74) 

In n-dimensions there are n(n — 1) Christoffel symbols of this type. 

(c) i and j are equal but distinct from k. In this case g ik = g jk = 0 and (3.71) 
reduces to 

k *i (3.75) 

In n-dimensions there are n(n - 1) Christoffel symbols of this type. 

(d) i, j, and k are all equal. Then 

n = WQus (3.76) 

In n-dimensions there are n Christoffel symbols of this type. 

Before deriving (3.71), let us illustrate the above results. 

Problem 3.5. 

Compute the Christoffel symbols in spherical coordinates. 

Solution. 

From (3,54) to (3.56), 


idol = 

gp-g„ gp g* gp'gfl 

g* 

= 

10 0 

Op 2 0 


. ■ gflg«_ 


0 0 p 2 sin 2 <fr_ 


Hence, by Exercise 2.10(b), 

fl 0 0 


[<? u ] = IgijY' = 


p 2 cse 2 ^J 
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Taking cases (a) (d) in order, we have 

(a) rl, = r$ s = o, rf 3 = r% = o, r ? 2 = = o 

(b) r [ 2 = rj, = V ‘011.2 = 2(i)(0) = o 

r! 3 = = \g ll g lus = 1(f)(0) = o 

rl, = rt = 20 22 022,i = Up' 2 )( 2 p) = p _1 

rf 3 = = W 2 922.2 = i(p 2 )(0) = o 

rli = = 2» 33 P.13.1 = i(p 2 csc 2 ^)(2psin 2 ^) = p 1 

n 2 = = 2 p 33 Pi3 2 = 2 (p” 2 csc 2 ^)(2p 2 sin^cos^) = col ij> 

(C) r > 2 = r% = - l 2 g"g 2 2.i = — a(l)(2p) = -p 

r33 = r ««= -2P n »33.i = — K 1 )(2p sin 2 ^) = — psin 2 ^ 

r?, = r*„ = -k 22 Pu.2= -i(p' 2 )(0) = 0 

Hi = r&, = -ip 22 p 33 ,2 = — i(P 2 )(2p 2 sin^cos^) = -sin^cos^ 

r?, = r2„= -if 33 *,,., - — 2 (p - 2 csc 2 ^)(0) = o 
rL = r% = -W 33 p22. 3 = -Hp~ 2 csc 2 M0) = o 
(d) rj, = r^ = 2 l ff ,, ff „. 1 = i(i)(0) = 0 
= r$, = |p 22 p 22 . 2 = i(p 2 )(0) = o 
r| s = rj. = k 33 P 33, 3 = |(p 2 csc 2 ^)(0) = o. 

While these results are fresh in our minds let us work another problem. 
Problem 3.6. 

Compute the roof and physical components a 2 and a' 2 ’ of the acceleration vector 
in spherical coordinates. 

Solution. 

From (3.65), 

a 2 = fj 2 + (u 1 ) 2 rf 1 + 2t>'t! 2 n 2 + 2y , « 3 r 2 3 + (t> 2 ) 2 rf 2 + 2wVrf 3 + (y 3 ) 2 r|j. 

From Problem 3.5 we see that the only non-zero Christoffel symbols in the above 
expression are Tf 2 { = T| , ) and Tfj. Thus, with a 2 = a *, etc. we have 

a* = V* + 2»vrk + (a*) 2 r^ 

= 4> J r 2p~ l p<j> — 0 2 sin^cos^. 

Again, from Problem 3.5, g 2 g 2 = p 22 = p~ 2 . Hence |g 2 | = p~' and we obtain 
£,<« = <j< 2 > _ a 2 /|g 2 | = pij, + 2p<f> - p0 2 sin <j> cos tj>. 

Note that the physical component a { *\ unlike the roof component a *, has the dimen- 
sions of acceleration, [length/(time) 2 ]. 

To derive (3.71) note first that 

9i j.k = (g i • g j), k = g i. k ■ g j + g, ■ g ;,*> (3-77) 

which, with the definition (3.63) of the Christoffel symbols, takes the form 

9ij,k = r&gp-gj + r&gp-g, = rf k g pJ + r%g pl . (3.78) 

Next interchange t and k and then j and k in (3.78) to get 
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9kj.i = Ui9 P j + r ji9 P k- 

(3.79) 

Sou = rfjg P k + r kj9 P , 

Now add (3,78) to (3.79), and subtract (3.79) 2 from the resulting sum. As 
FJ* = rf j5 etc,, there follows 

Gii.k + Skj.i ~ 9ikj = IFfkffpj- (3.80) 

Finally, multiply both sides by jg qj and sum on the repeated index j. As 
g 9i g pj = s q , the resulting expression reduces to 

17* = k qj (9ij.k + 9kj,i ~ g ik .j), (3.81) 

which is just (3.71) with the indices relabeled. 


A Change of Coordinates 

A change of coordinates from a tF-system to, say, a u '‘-system is defined by a 
transformation of the form 

u J = V(u k ). (3.82) 

In what follows we shall assume that (3 82) is 1 : 1 and continuously dif- 
ferentiable except, possibly, at certain exceptional points. For example, a 
change from circular cylindrical coordinates ( r,0,z ) to spherical coordinates 
(p,<j> y 0), as indicated in Fig. 3.5, takes the form 

r-p sin^ (3.83) 

0 = 0 (3.84) 

z = p cos <j>, (3.85) 

where 0 < p, 0 < # <, n, 0 < 0 <2n, p = 0 being the only exceptional point. 

Suppose that the various components of a vector v or a second order 
tensor T are known in the tF-system. How do we compute the corresponding 



Figure 3.5. 
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components in the u‘-system? As we saw in (2.36) and (2.38), this can be done 
once we have expressed the base vectors g j = d\/du J as a linear combination 
of the base vectors g, = dx/du' or vice versa. But if, initially, x = k(u J ), then 
by the chain rule 

gj = (8x/du l )du l /du J + (dx/du 2 )du 2 /du j + - -- 


= du'/du j g,. 

(3.86) [3.36] 10 

Thus (3.86) is of the same form as (2.31) with 


Aj = du‘/du J . 

(3.87) 

Of course, now, in general, the Aj vary from point to point. It is also apparent 
from symmetry just reverse the roles of u j and u k — that 

(^4 )j = du'jduK 

(3.88) 

It therefore follows immediately from (2.36) and (2.38) that 


Vj = (du‘/du J )v ( , v‘ = (du‘/du J )v J 

(3.89) 

7 ]j = (d U k /du i )(du p /du j )T kp 
f) = (du i /du k )(du p /du i )T k p 
f;‘ = (du k /du J )(du i /8u p )T k p 
f iJ = (du i /du k )(du J /du p )T kp . 

(3.90) 


These formulas are important enough to be boxed. Many texts on tensor 
analysis virtually start by taking (3.89) and (3.90) as the definitions of co- and 
contravariant vectors and co-, contravariant and mixed tensors of 2nd order. 
That is, any collection of objects (»,, v 2 , ■ ■ •)> or v i for short, is said to represent 
a covariant vector providing its components transform according to (3.89), 
with similar definitions for contravariant vectors v\ covariant tensors T tp etc. 
This viewpoint is taken sometimes in general relativity and in the theory 
of shells and membranes because in both cases one is dealing with curved 
continua. 

Problem 3.7. 

Suppose that in circular cylindrical coordinates, a 2nd order tensor T has the mixed 
components 

(" 2-1 1 
[7} f ]= 0 1 2 

3 0 — 2_ 

at the point r = 1, 0 = jc/4, z = —y/i. Find the component f 2 l of T in spherical 
coordinates. 

10 To obtain the 2nd line of (3.36) from (3.86), take i = k,j = p and u" = x p . 
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Solution. 

With (u l , u 2 , u 3 ) = (r,0,z) and (u 1 ,u 2 ,u 3 ) = (p,</>,0), we have, from (3.83) to (3.85), 



drjdp 

Br/3i/> 

dr/30 


sin <j> 

p cos ^ 0 


30/dp 

dOjdtj> 

30/30 

= 

0 

0 1 


_ 3z/3p 

dzm 

3z/30_ 


_cos^ 

— p sin </> 0_ 


Either by inverting this matrix or else by inverting (3.83) and (3.85) and then comput- 
ing dp/dr, etc. we obtain: 



3p/8r 3p/30 3p/8z 


sin ^ 0 cos ^ 

[rlfi'/OiC] = 

d</>/dr 3^/30 3</>/3z 

= 

p“‘cos^ 0 -p~‘sin^ 


JO/dr 30/30 B0/8z_ 


0 1 0 


From (3.90) 3 , 

f 2 ‘ = (Bu l /8u p )lT k p (3u k fdu 2 )] 

= (3u l /8u 1 ) [TV 1 (3u l /3u 2 ) + T 2 l (du 2 /du 2 ) + ■••] 

+ (Bu l j8u 2 )[T/ 2 (Bu l /du 2 ) + ■■■ ] 

+ (du'/du 3 ) [■ • • + r 3 ' 3 (du 3 /(?u 2 )] 

= (sin^)[7y'(pcos^) + T 3 '( — psin^)] 

+ (cos </)[T[ 3 (p cos^) + T'j 3 ( — p si n <^>) ] . 

At the given point, p = 2, sin ^ = J, and cos^ = - yfi/2. Hence, 

V = Q)[(2)(-v/3) + (1)(-1)] + (-y3/2)[(3)(-y3) + (— 2)(— 1)] = 4-2^3. 

Transformation of the Christoffel Symbols 

The transformation of the Christoffel symbols is a bit elaborate but straight- 


forward. By definition 

f* = g‘(dgj/du k ). 

(3.91) 

From (2.33) and (3.88), 

g‘ = (du , /du p )g p . 

(3.92) 

Furthermore, from (3.86), 



d li 



du k 

du k \du^ q ) 



d 2 u q du q dg q 

~da J da kgq + du J da k ' 

(3.93) 

But the base vectors g 4 are functions of the u r which, 

in turn, are functions of 
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the u k according to the given change of coordinates (3.82). Hence, by the 
chain rule and the definition of the Christoffel symbols, 

<5g, = <5g i dtS 
du k du r 8u k 


= ^P.g, 


(3.94) 


Substituting (3.94) into (3.93), and (3.93) and (3.92) into (3.91) and noting that 
g p 'g„ = 5 P and g p • g s = 8 P , we obtain 




du‘ / 8 2 u p 
du p \ du J u k 


8u q du r 
du j du k 


(3.95) 


Because of the underlined term, the new Christoffel symbols are more then 
just a linear combination of the old. This is why the Christoffel symbols 
cannot be regarded as the components of a 3rd other tensor (and therefore 
why some authors write { }*} instead of Fj ft ). Whenever Cartesian coordinates 
can be introduced 1 ', (3.95) provides a useful, alternative way of computing 
the Christoffel symbols— take the old coordinates u‘ as a set of Cartesian 
coordinates x 1 and the new coordinates u‘ as a set of general coordinates u‘. 
Since the Christoffel symbols vanish in any set of Cartesian coordinates, 
(3.95) reduces to 


p = -1 
jk dx p du j du k ' 


(3.96) [3.37] 


Exercises 

3.1. Cite three different sources that define inertial frame. Then give a definition in 
your own words. 

3.2. A ball-bearing of mass m is shot into the air vertically from a spring-loaded 
cannon whose muzzle is (lush with the ground. The spring is linear with spring- 
constant k and is retracted a distance D from the muzzle. If the spring imparts 
all of its stored energy to the ball-bearing, how high does the ball-bearing fly? 
Neglect air drag and take the force of gravity to be constant. 

3.3. If x = te x + t*e y -t- t 3 e z , -oc < t < oo, find: a) x; b) x. 

3.4. Let 

x = *(r), tx <t < p 

be the parametric equation of a curve C. If x is differentiable, the velocity v = x 
may, in view of (3. 1 2), be represented in the form 

1 1 Not possible on intrinsically curved continua such as balloons. 
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v = | v| t = it. (3.97) 

t is called the unit tangent to C. (At points along C where v = 0, t is undefined, 
but the representation (3.97) remains valid since, at such points, v = Ot = 0.) If s 
and t are differentiable functions of t, then 

x = v = a = st + st = a p + a c . (3.98) 

a p is called the path acceleration and a c the centripetal acceleration. Note that if 
we differentiate both sides of the identity t t = 1, we obtain t t + t - 1 = 0. Thus 
t t = 0, which implies that a p and a £ are orthogonal. 

(a) . Show that 

a p = (x • x/x • x)x, a, = x x (x x x)/x • x. (3.99) 

(b) . Compute a p and a c for the curve in Exercise 3.3. 

If we regard the unit tangent t as a function of arc length s, then we may always 
represent t' = dt/ds in the form 

t' = |t'|i' = hn. (3.100) 

k is called the curvature of C and n, which is the direction of t', the unit normal 
to C. 

(c) . Using the chain rule, show that 

a„ = ici 2 ii 

and, from (a), that 

k = |x x X|/|x| 3 . (3.101) 

(d) . Compute k for the curve in Exercise 3.3. 

(e) . The unit vector b = t x n is called the binormal to C. 

Show that b' ( = dh/ds) and n' may be expressed in the form 

b'=-tn. (3.102) 

n'=— Kt + ib. (3.103) 

The scalar t is called the torsion of C. 

(f) . Show that 

T =(x X X) ‘ x/| X X x| 2 . 

(g) . Compute t for the curve in Exercise 3.3 

The three linear, first order, vector-valued differential equations (3.100), (3.102), 
and (3.103) are called the Serret Frenet equations and are fundamental in the 
study of space curves. Given k and t as functions of arc length, the solution of 
the Serret Frenet equations determine a curve uniquely to within a rigid body 
movement. See Struik, Differential Geometry , 2nd ed.. Sect. 1 8. 

3.5. The parametric equation 

H: x = a(cos 0e x + sin 0e f ) + bfle 2 , 0 < 0 <, 2tt, 

represents a segment of a right-handed circular helix that lies on and wraps once 
around a right circular cylinder of radius a. 
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(a) . Compute the arc length s along H as a function of 0, starting at 0 = 0. 

(b) Compute the curvature and torsion of H. 

3.6. A rigid whirling arm with a tilting seat at its end may be used to reproduce 
accelerations that a person might feel on a ride in an amusement park or in an 
air or spacecraft. With reference to Fig. 3.6, the location of the center of the head 
of the person seated in the tilting chair is 

x = (a + !isin0)e r + hcos</>e z , 

where a, h, and e 2 are constant but 0 and e, may vary with time. Let {e u ,e„,e 0 } 
be a rigid frame attached to the chair, where 

e„ = e P cos ^ — e 2 sin <j>, e v = e r sin^ + e z cos^, e„ — e. x e r . 

The acceleration of the head may be expressed as 
a = a u e„ + a v t v + a„e„. 

Express a„, a„, and a 9 in terms of 0 , 4> and their time derivatives. 

3.7. Carry out the following steps to reduce the determination of the orbit of a 
mass-point in a central force field to two successive quadratures: 

(a) . User r 2 0 = r 0 to reduce (3.41), to a 2nd order differential equation in r. 

(b) . Multiply the equation so obtained by r, assume that f r — v'(r), and show 

that the resulting equation can be cast into the form ( )' = 0. 

(c) . Integrate, solve for r, and write the resulting equation in separable form, 

thereby reducing the solution for r to a quadrature. 

(d) . With r = 7(r) in hand, how would you find 0 = 6(t)l 

3 . 8 . Compute the roof base vectors {g*,g 2 ,g 3 } = {g'\ g' 3 , g 8 } in spherical coordi- 
nates (p, <f>, 0). (Hint: Since the cellar base vectors are mutually 1, g p = ig p , etc.) 
Check your answers by computing the matrix = [g‘ g'] and comparing 
your result with the same matrix computed in the Solution to Problem 3 5 
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3.9. For the (u, v, vv) coordinate system defined by 



compute: 

(a) . The cellar base vectors 

g„- g„, g*- 

(b) . The reciprocal base vectors at 

« = _l, t> = l, w = -1. 

(c) . The Christoffel symbols at any point. 

(d) . The roof components of the acceleration vector. 

(e) . The physical components a < “ ) , a {v) at u = — 1, » = 1, w = — L. 

3.10. Assume that we are in two dimensions. Given the transformation 



compute 

(a) . The base vectors g, = g„, g 2 = g„. 

(b) . The reciprocal base vectors. 

(c) . The 6 Christoffel symbols TJ , = T“„, T{ 2 = FJ„, etc. 

(d) . The roof components a “ = a 1 and a v = a 2 of the acceleration vector. 

(e) . The physical components a*" 1 and a 1 *’ 1 . 

(f) . If f = uv g B , write down the component form of Newton’s Law in the uv- 

coordinate system. 

3. 1 1 . By differentiating both sides of g‘ ■ gj = Sj, show that 

g^-rjfcg*. (3.104) 

3.12. Let C be a smooth curve having, in general coordinates, the parametric equa- 
tion C: u‘ = d'(f), a < t ^ b. By definition, the square of the differential clement 
of arc length is ds z = dx dx, where x = 3i(iJ‘(t)) is the position of a point on C. 
By the chain rule, 

dx = (dx/du^idu'/dQdt = gtu'dt. 


so 

dx dx = (gjU‘dt) (gjU J dt) 

= giju‘u J dt 2 . (3.105) 

The distanc e along C between two points P(r,) and P(t 2 ) is therefore equal to 
j!‘ s /()tjU , u J dt. Because of this relation, the g ijy especially in general relativity, 
are sometimes referred to, collectively, as the metric tensor. 

Let 6 = </> — t, 0 < r < rt denote the parametric equations of a curve lying on 
a sphere of radius R. Express the length of C as a definite integral. 
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3.13. Compute the roof components of the vector v ~ (x 2 + y 2 ,z, -2) in spherical 
coordinates, (3.42). 

3.14. Compute the 6 Christoffel symbols of the two-dimensional un-coordinate sys- 
tem defined by 

x = 2e uv , y = — e 3 "* 2 ”. 

3.15. Generalize Exercise 3.14: Determine formulas (using matrix notation if you 
prefer) for the Christoffel symbols of the u^-coordinate defined by 

x‘ = c‘exp(/t'u J ), 

where [/4j] is a constant matrix. (Recall: no sum on i on the right because i 
appears twice on the roof.) 

3.16. (a). Determine the transformation x‘ = X‘(u J ) such that r‘ Jk = a t 3‘, where the a k 

are constants. 

(b). Does there always exist a transformation of the form x‘ = .t'(u J ) such that 
the Tj* take on arbitrarily prescribed values? (Hint: First try special two- 
dimensional examples.) 

3.1 7. A class of 3-dimensional cylindrical coordinates is defined by a transformation of 
the form 12 


x + iy — f(u + it;) 


where / is an analytic function. Recalling that 

/'(« + iu) = x, + iy M = y „ - ix 

show that 

(a) - g„ - i&v = (e* - ie,)/'. 

(b) . the coordinate system is 1. 

(c) . J = l/'l 2 . 

(d) . g„,„ - ig„.„ = (g„ - !&,)/"//', g„.„ - ‘gv.v = (g„ + ig»)/"// 

(e) . rz„ = n= - k„ = mr/n r^= -r;„= - k v = Jinn 

where denotes “The real part of” and ./ denotes ‘The imaginary part of”. 
Compute the Christoffel symbols and sketch a few of the coordinate lines in the 
plane z = 0 for 

(f) . parabolic cylindrical coordinates, defined by 

/ = l(u + iv), -x < u < oo, 0 < v. 

(g) . elliptic cylindrical coordinates, defined by 

/ = cosh(u + id), 0 < u, 0 < v < 2 k. 

(h) . bipolar cylindrical coordinates, defined by 

/ = coth(u + iv), -oo <u < x, 0 < D < 71. 

3.18. Express Newton’s Law in component form in elliptic cylindrical coordinates 
12 z is the 3rd Cartesian coordinate and not the complex variable x + iy. 


CHAPTER IV 


The Gradient, the Del 
Operator, Covariant 
Differentiation, and the 
Divergence Theorem 


Suppose that you had a topographical map of a piece of land and wanted 
to indicate at a spot P on the map the slope m of the land in a direction t. 
This could be done by drawing a vector mt from P, as indicated in Fig. 4.1. 
Obviously, if the terrain is smooth but not level, there is one direction from 
P in which the slope is a maximum. This is called the direction of steepest 
ascent . 1 The associated vector is called the gradient of the elevation at P. If 
you draw a contour line through P, you will realize that the gradient at P 
must be 1 to this contour. 


The Gradient 

This 2-dimensional example suggests the following definition in n-dimen- 
sions. Let fix) be given in some region of £„. 2 The gradient of f at x is the 
vector having the direction and magnitude of the maximum increase of f with 
respect to distance away from x. Wc shall denote the gradient of / by V/ and 
represent it by an arrow with its tail at the head of x. When we wish to 
emphasize that V/is a function of position, we shall write V/(x). 

The gradient arises in many physical contexts. In the theory of heat trans- 
fer, Fourier’s law of heat conduction for a materially isotropic body states 
that the heat flux at a point is proportional to the negative of the temperature 
gradient at that point: q = - kVT, k > 0. In ideal fluid flow (no vorticity), the 

1 Ai a dome-like peak, all directions are directions of (zero) steepest ascent. Cone-like peaks are 
excluded by our assumption of smoothness. 

1 Such a function is sometimes called a scalar field. Likewise, vector and tensor functions may 
be referred to as vector and tensor fields , respectively. 
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velocity at any point in the fluid is equal to the gradient of a potential, 
v = \<j>. And in the torsion of prismatic bars, the shear stress vector acting at 
any point on a cross section is equal to the cross product of a unit vector k 
along the axis of the bar with the gradient of a stress function: t = kx 

The first aim of this chapter is to express V/ in a concise form, valid in any 
coordinate system. To this end, it is convenient to introduce the gradient 
indirectly, as follows. 

Let a set of general coordinates be defined by a transformation of the form 
x‘ = £'(«■'). The position x of a point P is then a function of the u J ' s and, 
therefore, so is /. Further, let C be a smooth curve having the parametric 
representation 

C:u J = H j (s), a s <, b, (4.1) 

where s is arc length. For example, in circular cylindrical coordinates, the 
helix H of Exercise 3.5 is described by 

H:r = a, 0 = s/c, z = bs/c, 0 < s < 27tc, 

where a and b are given constants and c = yja 2 + b 2 . 

Along C,/ is a function of s via the u j 's. Assuming that the partial deriva- 
tives of / with respect to the u j ’s exist, we may compute the rate of change of 
/ with respect to distance along C by the chain rule: 

df dfdu 1 df du 2 
ds du 1 ds du 2 ds 


dfdu^ 
du 1 ds 


The left side of (4 2) is a scalar invariant called the directional or path deriva- 
tive of /. It depends on /(x) and C, but not on the coordinate system used to 
specify x and C — if / is temperature and C represents a mountain trail, I 
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need only a thermometer and a pedometer to measure dfjds. On the other 
hand, the terms on the right side of (4.2) are evidently coordinate-bound. To 
interpret these terms geometrically, let £(s) denote the position of points on 
C and recall from the preceding chapter that t = £'(s) is a unit tangent to C. 
As x* = x‘(u j ) and, on C, u J = tP(s) we have, by the chain rule, 

t = (0x/du‘)(du‘/ds) = g ,f‘. (4.3) 

Thus, with our short-hand notation for partial derivatives, (4.2) takes the 
form 

df/ds * fjt 1 . (4.4) 

A glance at (2.10) shows that the right side of (4.4) is simply the dot product 
of t with the vector 


v/ = /,g\ 

(4-5) 

df/ds = V/t. 

(4.6) 


To justify our presumptuousness in denoting /.g' by V /, we note that if the 
right side of (4.4) is evaluated at a fixed point P*, then only the r n s change 
when one curve through P* is replaced by another. /, at P* is merely the 
partial derivative of/ with respect to u‘ evaluated at uj,, the ith coordinate of 
P + . That is, df/ds at P* is a function of t but not V/. Because df/ds = V/ t, 
df/ds is a maximum when t is aligned with V/ and, because |t| = 1, the value 
of this maximum is just |V/|. Thus V/ fits the geometric definition of the 
gradient given at the beginning of the chapter. 


Problem 4.1. 

If 

f = xy + yz + zx, 

compute V/ and |V/| at (12,5, —9). Compute the corresponding cellar components of 
V/ in circular cylindrical coordinates. 


Solution. 

Noting that 

/.x = y + z, f., = * + z , L = y + x, 
and evaluating these partial derivatives at (12, 5, —9), we find that 

V/ ~ ( - 4, 3, 1 7), | V/| = 7T-4) 2 + W + W) 2 = v '314. 
In circular cylindrical coordinates. 

/= r 2 sin 0 cost! + rz sin 0 + rz cos 0. 
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Hence, by (4,5), the cellar components of V/ are 

f r = 2rsinflcos0 + z sin 0 + z cos 0 

/„ = r 1 cos 2 0 - r 2 sin 2 0 + rz cos 0 - rz sin0 

/. = r sin 0 + r cos 0. 

The point P m with Cartesian coordinates (12,5, —9) has circular cylindrical coordi- 
nates (13, tan 1 5/12, —9), so that at P„, 

/. r = (2)(13)(5/13)(12/13) + ( — 9) (5/1 3) + (-9)(12/13) = -33/13 
f e = 56, f z = 17. 

Alternatively, we can apply (3.89), with t>, = /,. In this case (3.89), reduces to the 
chain rule: 

8f _ 8u‘ 8f 
8u j 8u j du 1 

With (u 1 ,u 2 ,t^ , ) = (r,0,z) and (u',u 2 ,u 3 ) = ( x,y,z ), it follows that at P* 
v‘ = /. r = fjdx/dr) + f.,(8y/8r) + fjdz/dr) 

= f x cos 0 + f,sinO 
= (-4)(12/13) + (3)(5/13)= -33/13, 
etc. 

Equation (4.6) reveals another important property of the gradient: If / is smooth in 
a neighborhood of a fixed point P*(x„), then V/(x„) is 1 to the surface /(x) = /(xj at 
P*. Why? Because / is a constant on all curves lying in the surface. Hence at P*, on 
any smooth curve C t passing through P*, 

(d//ds), = V/(x,)-t, =0, (4.7) 



Figure 4.2. 
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where t* is the unit tangent to C„ at P t . Fig. 4,2 illustrates (4.7) for curves Cj, 1 ’ and C* 21 
having distinct unit tangents t', 1 ’ and tj*. 


Probu-im 4.2. 

Find the Cartesian coordinates of that point P*(xJ on the ellipsoid. 

J = p + h + 3 2 = 1 

where the outward normal vector has Cartesian components (1, 1,1). 


Solution. 
From (4.5) 


V/ ~ (2x, y/2, 2z/9). 

As V/ points to the outside of the ellipsoid (why?), we seek a positive constant k and 
a vector x„ ~ (x*,y»,z*) such that V/(xJ ~ (2x„, yJ2, 2zJ9) = k(l, 1, 1). Thus x* = 
k/2, = 2k, z„ = 9k/2. Since x* must lie on the ellipsoid, (k 2 / 4) + k 2 + (9k 2 / 4) = 1. 
This equation has the positive solution k — 2/ v / T4. Hence x„ ~ (1,4, 9)/ x /l4. 


Linear and Nonlinear Eigenvalue Problems 

Linear and nonlinear eigenvalue problems arise in nearly every branch of 
mechanics and physics. A simple but typical problem— simple to state, that 
is — is to determine the possible shapes of an idealized, having chain when 
its upper end is spun at a constant rate about a vertical axis. Approximate 
solutions to such problems may be sought by attacking related finite-dimen- 
sional problems, obtained from the original problem by applying some dis- 
cretation procedure such as the finite element method. Often, the finite- 
dimensional problems can be phrased as follows: Find those points on the 
n-dimensional surface /(x) = 0 whose distance from the origin is stationary. 
If / is smooth, this form of the problem may be recast as an eigenvalue 
problem involving V/ and given a simple geometric interpetation. The idea 
may be illustrated in two-dimensions. 

A typical smooth curve /(x) = 0 is shown in Fig. 4.3. At any point P(x) 
where a circle centered at the origin is tangent to /(x) = 0, V/ must be paral- 
lel to x. That is, there is some scalar X such that 

V/(x) = AX. (4.8) 

Values of X satisfying (4.8) are called eigenvalues and the associated solutions 
for x, eigenvectors. If / is a quadratic function of the Cartesian components 
of x, then V/ becomes a linear operator or tensor, and (4.8) reduces to the 
familiar eigenvalue problem studied in linear algebra. 



76 


IV. The Gradient, the Del Operator, Covariant Differentiation 



Problem 4.3. 

Find the point(s) on the graph y = I/x 4 closest to the origin. 

Solution. 

Let / = x 4 y - 1. Then V/ ~ (4x 3 y, x 4 ) and (4.8) reduces to the two scalar equations 
4x 3 y = Xx and x 4 = Xy . 

(Note that these equations constitute a nonlinear eigenvalue problem.) As x ^ 0 
(why?), the first equation yields X = 4x 2 y. Thence, from the second, x 2 = 4y 2 or x = 
±2 y . The points we seek must lie on /(x) = 0. Hence 16y 5 - 1 = 0 or y = 16 ~ 15 , 
Thus there is only one eigenvalue, X = 16 2 ' 5 , but two associated eigenvectors, x ~ 
(± 16 1,,2 °, 16" 1 ' 5 ), each lying the same distance, 16 -l/5 > /5» from the origin. 


The Del Operator 

The Leibniz notation df/ds has two advantages. It allows us to manipulate 
the path derivative off as the quotient of two differentials and it allows us to 
think of the path derivative of / as the result of applying the linear differential 
operator d/ds to the function /. This second interpretation (but not the first!) 
carries over to V/: application of the del operator V to f produces V/. 

The component form of the del operator, 

V = g l 0/du l . 


follows immediately from (4.5). 


(4.9) 
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The Divergence, Curl, and Gradient of a Vector Field 

The divergence, curl, and gradient of a vector field $(u J ) arise by taking the 
three possible vector products of V with v. Thus we have 

(a) . The dot product 

V- v = g'-v,. (410) 

This is a scalar field called the divergence of v (sometimes denoted by 
div v). 

(b) . The cross product 

V x v = g' x v,.. (4.11) 

This is a vector field called the curl of \ (sometimes denoted by curl v). 

(c) . The direct product 

Vv = g i v j . (4.12) 

Because d\/ds = {dv/dufidu'/ds) = v,(g't) = (Vv) T t, we call (Vv) r (and 
not Vv) the gradient of v (sometimes denoted by grad v). 

By taking V as the second factor in the above vector products, we arrive at 
the scalar, vector, and tensor operators v • V, v x V, and vV. All of these are 
used in continuum mechanics. We shall show presently how v • V arises. 

Problem 4.4. 

If v = ze x + xye y + xyze 2 , compute V - v, V x v, and Vv. 


Solution. 

v x = ye y + yze z , v_ y = xe y + xze 2 , v. = e x + xye z , and, in Cartesian coordinates, 
g 1 = e x , g 2 = e y» g 3 = *i- Hence 

V • v = e x • v. x + e y • v y + e z -v x 
= 0 + x + xy = x(l + y). 

V x v = e x x v x + e y x v y + e z x v, z 
= (T*r - yze y ) + (xze x ) + (e„) 

= xze x + (1 - yz)e y + ye.. 

Vv = e,v x + e y Vj, + e 2 v 2 
= ye f e v + yze x e z 


+ xe y e f + xze y e z 
+ e.e x + xye.ej. 
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In heat transfer V q, evaluated at a point P, measures the rate of heat outflow from 
a neighborhood of P, where q is the heat flux vector. In fluid dynamics, V(pv), 
evaluated at a point P, measures the rate of decrease of mass in a neighborhood of P, 
where p is the mass density and v is the fluid velocity. In solid mechanics, Vu, 
evaluated at a point P, measures the change in volume of the particles in a neighbor- 
hood of P when P undergoes a displacement u. 

The curl and gradient of a vector field have other important physical interpreta- 
tions. In fluid dynamics, the vector to = V x v is called the vorricity. The change from 
point to point of to is a measure of the amount of friction or viscosity in the flow, while 
the symmetric tensor D = i[Vv + (Vv) 7 ], called the strain-rate, determines complete- 
ly the change in shape of an arbitrarily small neighborhood of particles centered, 
instantaneously, at a point P. In classical fluid dynamics, a knowledge of D at P 
allows the stresses in the fluid in a neighborhood of P to be calculated. 


The Invariance of V • v, V x v, and Vv 


The invariance of V- v, V x v, and Vv is implied by our coordinate-free nota- 
tion, but, of course, must be proved. This can be done either by introducing 
the component form of these expressions and then showing that their values 
are the same in every coordinate system or else by giving definitions for these 
expressions that are coordinate-free. For brevity we illustrate the two ap- 
proaches with V ■ v. 

Let u J = i3 j (m*) define a second set of coordinates. Computing V ■ v in the 
u'-coordinatc system via (4.10), we have, 

V • v = g‘ • {dv/du‘). (4. 13) 


By the chain rule 

Vv = g '((l\jdu J )(du J /8u l ) 

= g J (dv/du i ), by (2.34) and (3.87). (4. 14) 

This is just (4.10) with j instead of i. Thus, whether (4.10) or (4.14) is used 
to compute V - v, the values obtained will be the same. That is, V • v is an 
invariant. 

The second way of establishing this invariance is to show that at any point 
P 


V • v = lim J v n<L4/VolRas ||R||-0. (4.15) 

Here R is a region of diameter ||R|| enclosing P and having a piecewise 
smooth boundary dR with an outward unit normal n and differential element 
of surface area dA. As the integral in (4.15) can be defined without reference 
to a coordinate system, V v must be invariant. Equation (4.15) is a conse- 
quence of the divergence theorem that we shall consider later. 
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The Covariant Derivative 


The covariant derivative appears automatically when we express v A in terms 
of the roof components of v. Thus 

v. i = (v J gjh 

= v !iSj + v %,i 
= 

= (v k i + r^hg*. (4.16) 

With the abbreviation 

V^sjJ + r*/ (4.17) 

(4. 1 6) reads 

».i = V,P»fc. (4.18) 

The symbol V,t* is called the covariant derivative of v k (with respect to u‘). It 
has been defined so that the component form of v., in general coordinates looks 
the same as it does in Cartesian coordinates, save that the partial derivative 
operator d/du‘ is replaced by the covariant derivative operator V,. 

To compute the covariant derivative of other objects, we insist on the 
following two properties; 

(a) The covariant derivative of an invariant, e.g. a scalar, a vector, or a 2nd 
order tensor, coincides with its partial derivative. 

(b) The covariant derivative of a product follows the same rule as does the 
ordinary derivative. 

Thus, consider the dot product (j> = u v = u J Vj of two differentiable vec- 
tors. It follows, with the aid of (4.17), that 

<t>.i = {u'vjh 

= U'lVj + U J Vji 

= (V,u J - rl k u k )Vj + U j v jfl 

= Vj'S/ i u } + u k (v kJ -r{ k Vj). (4.19) 

But if conditions (a) and (b) are to be fulfilled, we must have, with a renaming 
of dummy indices, <j>j = VjV^ + whence 

V,^ s Vjj - r^. (4.20) 

In Exercise 4.5, you are led to the definition of the covariant derivative of the 
components of a tensor and in Exercise 4.16, you are asked to derive (4.20) in 
a different way. 

Again, by (a), v , = V t v J gj = V ; t' J g- i , which, together with (b), (4.17), and 
(4.20), implies that 
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v f gj = g } ,i - rfjg* = o, v ig J = g/ ( + r{ kg k = o, (4.21) 

i.e., the base vectors gj and g J are covariantly constant. 

From (b) and (4.21), 

= Vi(g-&) = g* Vigj + gj'V.g* = 0, etc, (4.22) 

i.e., the components of the identity tensor 1 are covariantly constant. 

Finally, we note that the covariant derivative is defined on Riemannian man- 
ifolds (where the g u determine everything). Why? Because V, = d/du ' + 
(terms involving the Christoffel symbols} and these latter terms, by (3.71), 
depend on the g tj only. However, though d 2 /du‘du j = d 2 /du J du‘ (when ap- 
plied to a sufficiently smooth function), V ; Vj / V ; V,, unless the Riemannian 
manifold is flat. For a proof you should consult a more extended text such as 
McConnell’s Tensor Analysis. We shall also touch on this point in the last 
section of this chapter on Differential Geometry. 


The Component Forms of V • v, V x v, and Vv 

The component forms of V • v, V x v, and Vv follow readily with the aid of the 
above results. In particular, whenever V is applied to an invariant, we may, 
by (a), take V = g'V;. Thus we have for the: 

(a) Dot product 

V • v = g‘ • Vi(v'gj) 

= g' g 

= s;vy 

= vy = + ny. (4.23) 

Now it is a remarkable fact (see Exercise 4.17) that 

r J = J-'Jj, (4.24) 

where J is the Jacobian. We therefore obtain the extremely useful formula 

V’v = J -1 (Jt>V (4.25) 


Problem 4.5. 

Compute V • v in spherical coordinates in terms of the roof and physical components 
of v. 


Solution. 

From (3.57), J = p 2 sm<j>. Hence, with v = !; p g f) + + Cg,,, wc have, from 

(4.25), 


The Kinematics of Continuum Mechanics 


V v= - r V-,U-(p 2 sin^") -t- ^(p 2 sin^i>*) -t- 4(P 2 sin^t’*) . 
;/sin^Lf?p d</> 50 J 

We recall that t/° = vg' = f'/lg*! = v'/^/g*', where the repeated index is not to be 
summed. Thus, with the aid of the solution to Problem 3.5, 

v (p) =■ v f , v**’ = pv*, v iB) = p sin^y 9 , 
so that, with all products differentiated explicitly, 

2 . 1 5v w cot tj> . . . L dv m 

V-v = — +-»* rt + --5x + v<*‘+— — 

op p p dtp p ps\n<p ou 

(b) Cross product 

V x v = g f x V,(Vjg J ) 

= g‘ x g J V,Vj 

= g*, (4.26) 

i.e., the roof components of the cross product are given by 

(V x V)* = € tJk V i v j . (4.27) 

(c) Direct product 

Vv = g‘Vj(Vjg j ) = V,vjg‘g\ (4.28) 

i.e., the covariant derivatives V/Vj are the cellar components of the tensor 
Vv. Likewise the V t v J are one of the two mixed sets of components of Vv 
because we may also write 

Vv = g'Vi(v J gj) = Vyggj. (4.29) 


The Kinematics of Continuum Mechanics 

The kinematics of continuum mechanics describes the motion of continuous 
distributions of matter, called bodies. Each particle X in a body is identified 
with a point P that moves through E 3 . At time t, the set of all such points 
determines a region S{t) called the shape of the body at t. Suppose that S is 
known at some particular time, say t = 0. Let y be the position of P at time t 
and x its position at t = 0. Then a motion of the body is a transformation of 
the form 


y = #(x, r), x = $(x,0), xeS(O), -oo < t < oo. (4.30) 

If distinct particles are not to coalesce in the course of their motion, then we 
must demand that the transformation be 1 : 1. This means that for each y in 
the range of the motion, there exists a unique x given by an inverse transfor- 
mation of the form 
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x = *(y,t), yeS(t), -ooctcoo. (4.31) 

Except at wavefronts (where fields such as temperature, velocity, or stress 
may jump), we assume that the functions $ and £ have a sufficient number of 
derivatives for the various field equations of continuum mechanics to make 
sense. (Some of these field equations appear in Exercises 4.6, 4.8, 4.13, 4.31, 
and 4.32.) 

The velocity of a particle with position x at t = 0 is denoted and defined by 
y = y = S,(x,t)^H\,t) (4.32) 

and its acceleration by 

a = v = $ t (x, 0 = si(x, r). (4.33) 

In classical elasticity, the behavior of a body depends on the deviation of 
its present shape from its initial shape, so all field variables are taken as 
functions of t and x, which are called referential or Lagrangian coordinates. 
Classical fluid mechanics, in contrast, assumes that the behavior depends 
only on the rate of change of the present shape. The initial shape being 
irrelevant, one usually regards all field variables as functions of t and y, which 
arc called spatial or Eulerian coordinates. In particular, from (4.3 1) and (4.32), 
the velocity field takes the functional form 

v == v(x(y, £), t) = v(y, £). (4.34) 

As a consequence of this relation, the formula for computing the acceleration 
must be modified, because v now depends on t both explicitly through its 
second argument and implicitly through its first argument y. 

Because motion occurs in E 3 , we may always introduce a Cartesian basis 
{e, }. Then, with x = x'e„ y = y f e, and v = v‘e t , we have, by the chain rule, 

dv dy‘ dv 
3 = dy‘ Jt + Ft 

= v ‘ y .i + v .r (4.35) 

Recall the invariant operator V = g‘V,. As mentioned earlier, we can, given a 
vector v, form a new invariant operator v V = u'V ( . Applied to any field F 
(scalar, vector, or tensor), this new operator must produce the same result 
regardless of the coordinate system in which we express it. In the Cartesian 
coordinate system (y l ,y 2 ,y 3 ), t> ( V,v = u'v ,. Hence (4.35) can be cast into the 
invariant form 

a = (v- V)v + v ,. (4.36) 

The first term on the right is called the convective acceleration and the 
second term the local acceleration. The convective term accounts for the fact 
that even in steady flow (v , = 0), there can be acceleration as, for example, 
when an incompressible fluid flows through a converging nozzle. The differ- 
ential operator 
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( )' = (v- V + d/dt) (4.37) 

that appears in (4.36) is sometimes called the material derivative ; F gives the 
rate of change of F as we ride through the flow on a particle. 

Problem 4.6. 

Compute the acceleration of a particle using cylindrical Eulerian coordinates. Express 
the answer in terms of the physical components of the velocity. 


Solution. 

With v = t>*g k and a = a k g*, the component form of (4,36) reads 
a* = v y t v k + t>*„ 

where V.-d* is given by (4.17). One can verify easily that the only non-zero Christoffel 
symbols in circular cylindrical coordinates (r,0,z) arc r 22 = - r and T? 2 = r ’.Thus 

V, v' = t>‘, = v'„ V, v 2 = v 2 + r? 2 t> 2 = v* + r~‘ v e 

v 2 tr* = v*2 + r' 22 v 2 = v'j - rv°, V 2 v 2 = v 2 2 + r 2 2l v' = rj + r~ l v r 

Vjt? 3 = a 3 = if,, V 2 u 3 = v% = t’fg 

Vjti 1 = v%, V 3 v 2 = i>® , V 3 i> 3 = v%. 

In extended form 

a' = a'V,!) 1 + + v 3 V 3 v' + v'„ etc. 

That is, 

a’ = v'v, + v e (v' e - rv") + v ! v r z + o' 

a e = y r (o 9 r + r~'v s ) + i>“(o,9 + '■' l y r ) + o'o 9 + of, 


The relation between the roof and physical components of a vector u in circular 
cylindrical coordinates, namely « <r) = u r , u m = ru 0> , u l2) = u z , is found in the same 
way as we found (3.30) in two-dimensional polar coordinates. Hence, 

a (r> _ + r' l o w, (oI? - o 19 ’) + o u, o l2) + i#>, etc. 


The Divergence Theorem 

The divergence theorem is the main tool for deriving the local (differential) 
equations of continuum mechanics from global (integral) statements of the 
fundamental laws. For example, if p and v denote, respectively, the density 
and velocity field of a fluid streaming through a fixed, closed region R of 
space, then conservation of mass requires that 
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'( J»- , = J pvndA. (4.38) 

Here t is time and 8 R is the boundary of R, assumed to be piecewise smooth 
with outward unit normal n. Because (4.38) must hold for all regions R , it 
may be shown that the Divergence Theorem implies 

p, + V-(pv) = 0 (4.39) 

in the neighborhood of any point P where the density and velocity fields are 
sufficiently smooth. 

Most large calculus texts establish the Divergence Theorem in Cartesian 
coordinates. 3 In 2-dimensions it states that: 

J ( p , x + Q.y) dx dy = J Pdy-Qdx, (4.40) 

where P and Q are differentiable functions of x and y and R is some simply- 
connected region in the xy-plane with a simple, piecewise smooth boundary 

5K:x = *(t), y — ${t), a <, t £ p. (4,41) 

The right side of (4.40) is short-hand for 

[P(*(0, iW'(t) - Q(*(t),j)(t))C(t)] dt. (4.42) 

Our aim is express the Divergence Theorem first in invariant form and 
then in general coordinate form. To this end, let us assume that OR can be 
parameterized by its arc length s (measured from some point on dR in some 
direction along dR). Being piecewise smooth, dR has a unit tangent t every- 
where, except at a finite number of points. Furthermore, being simple and 
closed, dR has an inside and an outside. Let s increase in such a way that 
nstxt. points toward the outside of dR. With v = Pe x + Qe y , the inte- 
grand in (4.42) reduces torn. We also recognize that P x + Q y = V ■ v, Fi- 
nally, to disguise the Cartesian coordinates completely, we denote the differ- 
ential element of area dxdy by dA, thereby reducing (4.40) to the invariant 
form 

VvdA = I v nds. (443) 

Jj? J dR 

Our treatment of dA has been, admittedly, cavalier. Whatever its precise 
definition, 4 dA must be an invariant for the simple reason that everything else 
in (4,43) is! 


1 E.g. Thomas and Finney. Calculus and Analytic Geometry, 8th ed. 

4 See Buck. Advanced Calculus , 3rd ed., or Spivak, Calculus on Manifolds. 
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To express (4.43) in coordinate form, we set v = u'g, and n = n,g'. Then 
V • v = VjV' and v n = uhr,. What about the elusive dA? I think that the eas- 
iest way to derive a coordinate-bound expression for dA is to return to basics. 

Recall that the left side of (4.43) is merely a symbol for the limit of a sum of 
the form 

S = I Tj) (*<> TjKRy c R. (4.44) 

Here <j> = V • v and 

R ij = {(*> y)|x, < x < x, + Ax, y } < y < + Ay} (4.45) 

is a Ax by Ay rectangle whose Southwest corner is the point (x„ y,), where 
x ( = iAx and y, = jAy, i,j — 0, ±1, +2, — 

In forming the sum S, a rectangular grid is chosen for convenience. Any 
other will do provided only that the diameter of its largest cell can be made 
arbitrarily small. Under the change of coordinates 

x = *(«,»), y = j)(u, u), (4.46) 

R becomes the image of some region R* in the ut-plane, and a rectangular 
grid in the UD-plane will map into some curvilinear grid in the xy-plane 
(Fig. 4.4). An element Rfi of the rectangular grid in the wu-plane is mapped 
into some curvilinear quadrilateral Q u in the xy-plane. As in Fig. 4.5, the 
area of Q,j is approximated by the area |g„ x g„|Au Ad of the parallelogram 
having two co-terminal edges at the point (*(«(), .P^)). But |g u x g,,| = 
|x. u y,„ - y, u x,J = |J'(“j,D j )|, the absolute value of the Jacobian of the trans- 
formation (4.46) at (u„Vj). Thus the sum S may be approximated by a new 
sum of the form 

S* = ^(Mj.DjJIJO^DjJIAuAD, (u„ Vj)eRTj <= R*, (4.47) 

whose limit, as Au, Av -*■ 0, we denote by <f>\ J\ dudv. Hence 

dA = \J(u,v)\dudv. (4.48) 

The coordinate form of (4.43) is therefore 




Figure 4.4. 
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Figure 4.5. 


| V,v l \J\du l du 2 = J v‘n,ds, 

ions in 3-dimensio 
divergence theorem 

i^ dv -L 


(4.49) 


Analogous considerations in 3-dimensions show that the invariant and 
coordinate forms of the divergence theorem are, respectively, 


(4.50) 


and 

j* V f u‘|J| du' du 2 du 2 = j* v‘n,\K\dv l dv 2 . (4.51) 

On the right side of (4.51), it is assumed that the surface dR is represented in 
the parametric form 

dR:x = ft(v a ), a a <> v a <, b a , a =1,2. (4.52) 

with \K\ = \dx/dv l x dx/dv 2 \. 

The arguments that lead from the global form of conservation of mass, 
(4.38), to its local form, (4.39), may now be spelled out. If p\ is differentiable 
throughout R then, by the divergence theorem, the right side of (4.38) may be 
replaced by the integral over R of V- (pv), giving 

| [P., + V(pv)]dF = 0. (4.53) 

Now mass is conserved in all regions R. Hence, if we assume further that the 
integrand in (4.53) is continuous, it follows, because R is arbitrary, that the 
integrand must vanish, i.e., (4.39) must obtain. Why? Because if there were a 
point P* in the flow where (4.39) failed to hold, then, because p , + V-(pv) is 
assumed to be continuous, it would be of one sign in some small ball centered 
at P*. Taking R to coincide with this ball, we would get a contradiction. 
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Differential Geometry 

Differential Geometry is the study of how curved surfaces differ from planes; 
it is a beautiful subject with manifold physical applications. 

Intrinsic Differential Geometry, the guts of general relativity, deals with 
those properties of a surface that can be inferred from measurements made 
within the surface — its so-called metrical properties. For example, a cone is 
intrinsically identical to a plane because (after being cut along a generator) it 
can be rolled into the latter without being stretched. Thus an observant ant 5 , 
marching along a chemically marked triangle on a sheet of paper, would 
note, upon returning to its starting point, that it had turned through 2n 
radians, but would have no idea whether it was living on a cone, a cylinder, 
or a plane. 

Extrinsic Differential Geometry, indispensable in continuum mechanics in 
the theory of shells, regards a surface as embedded in Euclidean space and 
measures its properties relative to some reference frame in this space. We 
shall adopt this latter viewpoint, but will point out strictly intrinsic results 
(which are useful in the study of general relativity). 

The simplest, most symmetric surface in 3-dimensional Euclidean space is 
the sphere: finite but unbounded, identical in every part. This perfection is 
reflected in its Cartesian equation: 

S 2 . x 2 + y 2 + z 2 = r 2 . (4.54) 

The form of (4.54) not only implies that the equation of a sphere with respect 
to a frame Ox , x 2 . . . x„ in n-dimensional Euclidean space is 

S B -i : x\ + x\ + ■ ■ ■ + x 2 = r 2 , (4.55) 

but (4.55), in turn, suggests the more general class of quadric surfaces 

Q n -,:A,jX t Xj= 1, (4.56) 

where the Affs are constants and a repeated index is to be summed from 1 
to n. 


Gaussian Surface Coordinates 

In the first section of this chapter, we learned that, in 3 dimensions, the 
gradient of a function /(x), evaluated at a point on the surface /(x) = 
constant, was a vector normal to that surface. Thus, for a quadric surface Q 2 , 
we set /(x) = x • A x = 1, where A = A tJ t t t jt and thereby obtain 

V/(x) = 2Ax providing /(x) = 1. (4.57) 

It is this last constraint, to which we must always recur because the compo- 
5 No doubt a relative of the ant in Gravitation , by Misner, Thorne and Wheeler 
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nents of x are not independent, that makes it so difficult to extract informa- 
tion about surfaces specified in the form /(x) = constant. Therefore, to sim- 
plify life, we henceforth assume that any 3-dimensional surface S we wish to 
study is given in the parametric form 

S'; x = i(u,t>), y = j)(u,t>), z = z(u,v), ( u,v)eD . (4.58) 

Here, the pair of real numbers (u,t>) are called Gaussian surface coordinates 
and belong to some domain D in the uo-plane. For example, that portion E 
of the ellipsoid in Problem 4,2 lying in the first octant may be specified by 

E: x = sin u cos v, y = 2 sin u sin v, z — 3 cos u, 0 < u, v <, n/2. (4.59) 

In keeping with standard tensor notation, we shall, just as we did in going 
from the extended notation of (3.45) to the compact notation of (3.49), replace 
the three equations in (4.58) by the single expression 

S: x* = *'(tA w* e D , (4.60) 

where it is understood ( until further notice) that, while Latin indices range 
from 1 to 3, Greek indices range from 1 to 2. Often, we shall work with the 
alternative vector parametric form 

S: x = £(«*) = i l (u a )ei, u a e D. (4 61) 

We shall assume that the function £(u a ) is smooth enough for all subsequent 
differentiations to be valid. 


The Cellar Base Vectors 

If we fix u 2 in (4.61) and let u l vary, then, just as we saw in Chapter III, the 
cellar base vector g, = 8x/du l = x, is tangent to the u 1 -coordinate curve, 
x = &(u‘, constant). Likewise, the cellar base vector g 2 ~ dx/du 2 = x i2 is tan- 
gent to the u 2 -coordinate curve. The mongrel notation g„, which a purist 
might insist on re-writing as g‘ a , has the advantage that the boldface re- 
minds us that g,, is invariant under a change of 3-dimensional coordinates 
within the Euclidean frame Ox,x 2 x 3 while the subscript reminds us that g* is 
coordinate-dependent on S, transforming in a cellar-like (i.e., covariant) way. 
Let us verify this last assertion. Under a (differentiable, non-singular) change 
of variables 


u a 

= tf'OA 

(4.62) 

we have, by the chain rule. 

* 


, dx 

dx du a du “ 

(4.63) 

^ = 


which, by definition, is just the law of transformation on ; 

i surface for an 


object with one cellar index. 
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Non-Degenerate Surfaces 

If w 1 and u 2 are functionally related in (4.61), for example, if they enter only 
in the combination u 1 + (u 2 ) 2 , then (4.61) represents a curve. To avoid this 
possibility, we shall require [except perhaps at isolated points such as x = 
y = 0, z = 3 on the ellipsoidal surface £ in (4.59)] that g, = x a and g 2 - 
x 2 he non-zero and non-parallel, a condition we can express as |g,||g 2 | > 
ISi ' S2I or 

g = dettg^ g^] > 0. (4.64) 


The Metric Tensor 


The differential of x is denoted and defined by 

dx = x a du a = g a dM a (4.65) 

and the first fundamental form of S is denoted and defined by 

J = dx dx = ga gp du*du f (4.66) 

The metric tensor on S is denoted and defined by the direct product 

G = g*g a . (4.67) 


where, in analogy with our definition of roof base vectors in Chapter II, the 
roof base vector g a is defined implicitly by 


g g K 




1 if a = ft 
0 if a / p 


(4.68) 


In terms of the cellar components of the surface metric tensor, 


Qat = ga • G gp = & • g„, (4.69) 

the first fundamental form reads 


I = g afi du a duf 


(4.70) 


Geodesics 

Let P and Q be two fixed but arbitrary points on S and let 

C:u“ = A a (t), a < t < b (4.71) 

be the parametric equations of some curve on S joining P and Q. Then, by 
(4.65) and (4.69) see also Exercise 3.12 —the distance along C is given by 
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A geodesic G PQ is a curve that minimizes the distance between P and Q. 
Finding G PQ is a problem in the Calculus of Variations and forms a vast 
chapter in the book on Differential Geometry. 

Rather than attempting to introduce enough of the Calculus of Variations 
to derive the differential equation for a geodesic, 6 we shall, instead, keeping 
physical applications in mind, give a direct alternative derivation of these 
equations. This derivation is based on the mechanical principal that, of all 
possible static positions of a greased string stretched between P and Q and 
forced to lie in the surface S, the actual position(s) is that which mimizes the 
length of the string. But, because the string is greased and at rest, the force at 
any point exerted by S, which holds the string in equilibrium, can have no 
component tangent to S. This fact gives us an alternative way to derive the 
following differential equation for G pq . 

For convenience, we replace the parameter t in (4.71) by arc length s so that 
along G pq we have the unit tangent 

t = x'(s) = x x du*/ds = g a T a . (4.73) 

The force exerted at any point in the string by the material to the right of this 
point can be represented as T(s)r(s), where T is the tension. If f(s) denotes the 
force per unit length exerted on the string by the surface, then the balance of 
forces on a differential element ds of the string requires that d(Tx) = f ds. That 
is, 

T't + Tt' = f. (4.74) 

But f can have no component tangent to S. Thus, in particular, r f = T ' + 

Tt • t' = 0. Because t is a unit vector, t • t' = 0; hence T' = 0, i.e., the tension 

is necessarily a constant. More generally, we must have g v • f = 0; hence, from 
(4.74) with r = 0, 

g’ t' = 0. (4.75) 

This purely geometric condition is the differential equation for our geodesic! 

To express (4.75) in component form, we differentiate (4.73) with respect to 
s and insert the result into (4.75) to obtain 

g y -(&,dT a /ds + g^tV) = 0. (4.76) 

Now g* g a = Sf Moreover, the expression g v -g, ,, should (faintly?) ring a 
bell it’s just the right side of (3.69) with Latin indices replaced by Greek 
ones. By adopting the definitions and machinery for Christoffel symbols de- 
veloped at the end of Chapter III, we have, by (3.69) and (3.71), 

r^g’-g.,! 

= 29 yi id tt x,p + Gfix.* - 9*p.x)> ( 4 - 77 ) 


b See, for example. Advanced Calculus for Applications , 2nd ed, Section 7.8, by Hildebrand 
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where g yX = g y ■ g 2 . Thus (4.76) takes the form 

dx'/ds + ri p x a T p = 0. (4.78) 

The definition (4.17) of the covariant derivative holds in any number of 
dimensions. Thus, replacing Latin indices by Greek ones and noting that 
dr f /ds = (dx y /du*)(du*/ds) = T y T®, we may re-write (4.78) as 

T a V a T y = 0. (4.79) 

When attempting to solve these two equations (y = 1, 2), we must remember 
that the price for using arc length as the independent variable is that we must 
enforce the constraint t t = g xfi x*x fi = 1 at every value of s. 

Now consider a small bead that slides without friction along the surface S 
under no external forces save those exerted by S to keep the bead from fly- 
ing off. With a prime (as above) denoting differentiation with respect to arc 
length s and a superior dot denoting differentiation with respect to time t (as 
in Chapter III), we have, by the chain rule, 

x = rs, x = t s + t's 2 . (4.80) 

But if there is no friction, the force exerted by S on the bead can have no 
component tangent to S. Thus, in particular, i-x = j = 0,so that the path 
acceleration of the bead is zero. More generally, g y, x = 0 which, because 
s = 0, reduces to g v - t' = 0, the geodesic equation (4.75). 

If the equation of the geodesic is expressed as u® = u *(t), then, to obtain a 
differential equation for u®, we first re-write (4.80) in the component form 

x = g.w®, x = g a ti® + g (4.81) 

Then g y • x = 0 implies that 

u' + ri,M = 0. (4.82) 

This is precisely of the same form as the acceleration of a particle in a set of 
3-dimensional curvilinear coordinates which appeared as the coefficient of 
the mass m on the right side of (3.68). But there is more; If we let the Greek 
indices range from 1 to 4, then (4.82) is precisely the differential equation that 
describes the motion the so-called world line — of a mass point in general 
relativity. 1 


The Gaussian Curvature: An Intrinsic Invariant 

Suppose that our ant lives on a surface S, such as a sphere, that cannot be 
unrolled into a plane. Without leaving S, can it deduce that its world is 
curved? The answer is yes if the ant knows the theorem we shall now develop. 

7 For a fascinaling account of Einstein's struggles to perfect the equations of general relativity 
see the biography Subtle is the Lord ... by Abraham Pais, 
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Consider any three points A, B, and C on S that comprise the vertices of a 
unique geodesic triangle, ABC ; i.e.. A, B and C can be connected by unique 
geodesics. (This is not always possible if A, B and C are too far apart. For 
example, if S is a sphere and two of the points are the north and south poles, 
then there are an infinity of geodesics of equal length, the meridians, connect- 
ing the poles.) So that our ant has a fixed background against which to 
measure how much it turns in its triangular promenade, we assume that there 
is a continuously differentiable unit vector field X tangent to and defined over 
S in some neighborhood of ABC. This vector field, which is in no way unique, 
is simply a useful auxiliary that will make no appearance in our final results. 
On the sides of ABC we set 

X = rcos 0 + vsin 0, (4.83) 

where v is an outward unit normal to ABC lying in S and ± to t and where 
cos 0 = X • t and sin 0 = X- v. Differentiating both sides of (4.83) with respect 
to arc length s, we have 

X' = (-rsinO + vcos0)0' + r'cosO + v'sinO, (4.84) 

Because t and v are unit vectors, r • r' = v • v' = 0. Moreover, because r v = 0 
and because the sides of ABC are geodesics, n • r' = - r- v' - 0. Thus, dotting 
both sides of (4.84) with the coefficient of 0', we have 

0' = (-rsinO + vcos0)-2' = [(x - - (t - A')/] v, (4.85) 

If we set X = X a g„, then, by steps analogous to those that took us from (4.75) 
to (4.79), we arrive at 

g fi -X' - x'V.X*. (4.86) 

Thus, noting that = 5jj - v*v p , we have 

(f i)A' • v = x,A V(V„A')v, = A'(V.A')v, - V(V„A')v, (4.87) 

and 

(t-X')X v = x f x*(S/ a X»)X'\ = (V,X*)X\ - vW'V'v (4.88) 
If these expressions are inserted into the right side of (4.85), then the last 
terms on the right of (4.87) and (4.88) cancel, leaving 

O' = (X x W a X fi - X*V.X m )v t . (4.89) 

Let arc length along the geodesic triangle ABC be measured counterclock- 
wise from A and let .s B denote the distance along the boundary of ABC from 
A to B, s c the distance from A to C, and s A the length of the perimeter of ABC. 
Then, integrating both sides of (4.89) around ABC, noting that the integra- 
tion of O' must be broken into three integrals because 0 has jump discon- 
tinuities at the vertices of ABC, and using the 2-dimensional version of the 
divergence theorem given by (4.43), namely 
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Js, 


(4.90) 


we obtain 

0 |o* + + 0 |st = { KdA ' (4.91) 

Here s* = lim(s ± |e|) as c -> 0, and 


K a - A'V.A 01 ) 

= A^Va/L'’ - A%V a A«, 


(4.92) 


the term V /) A I 'V 0 ,A' i — V^A^V.A* = det(V /( A“) being zero because |A| = 1, Sec 
Exercise 4.26. Now the left side of (4.91) is independent of the coordinate 
system on S and the unit vector field A. Therefore, despite appearances, the 
same must be true of the right side. Moreover, as the various calculations 
leading to (4,91) depended only on vectors tangent to S and the Christoffel 
symbols (which are defined in terms of derivatives of the components of the 
metric tensor), we conclude that K, the Gaussian curvature, is an intrinsic 
invariant. Unfortunately, it takes some mathematical gymnastics to derive an 
intrinsic formula for K directly from (4.92). Instead, we shall soon take an 
indirect route to such a formula via some extrinsic differential geometry, but 
along the way we shall encounter many interesting things including the ori- 
gin of the nature “curvature”. 

Suppose that our peripatetic ant, crawling around ABC keeping the inte- 
rior of ABC to its left, measures the following change in its orientation at each 
vertex of ABC. 


A 0 B s 6(st ) - 0(s B ), A Of = G(s£ ) - 0(s ( ), A6 a s 2n + 0(0") - 0(s A ), 

(4.93) 


Substituting these numbers into (4.91), the ant obtains the following special 
case of the celebrated Gauss Bonnet Theorem : 8 


2 ji - (A0 a + A 0 B + A0 C ) = 


(4.94) 


If the left side of this expression is not zero, the ant concludes that its habitat, 
the geodesic triangle ABC, cannot be unrolled into a plane. Why? Because 
otherwise one could introduce a set of rectangular coordinates u l = x and 
u 2 = y such the first fundamental form of the surface reduced to / = dx 2 + 
dy 2 . In such a case all the Christoffel symbols arc zero and hence V /( A a = A’, 
which, by the second line of (4.92), yields K = 0. 


s See Differential Geo, 


K 2nd ed, by Struik. 
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Some Extrinsic Geometry 


The examples we have considered in this section involving a surface S — a 
stretched string, the frictionless motion of a bead, the change in orientation 
of an ant crawling around a geodesic triangle have all required that we 
study how vectors change along curves on S. Our focus has been intrinsic in 
the sense that our vectors have had no components normal to S and, more- 
over, only the in-surface components of their changes have been examined. 
We now expand our outlook and consider how an arbitrary Euclidean vector 
v defined on S changes as we move over S. 

In 3-dimensional Euclidean space, E 3 , any vector may be represented in 
the basis {g,}. However, the two surface basis vectors g„ do not span E 3 so 
that, if we wish to use them to represent an arbitrary vector v, we must aug- 
ment them with a third independent vector n. The most obvious choice for n 
is one of the two unit normal vectors to S at a point with coordinates u a . (Of 
course, for n to exist at such a point, S must be smooth and not have a ridge 
or be cone-like there.) For definiteness we choose 


„ _ gl X g 2 
lg| X g 2 | 


g 1 X g 2 

Ig 1 x g 2 l 


= K , t x g'. 


(4.95) 


Here sec the analogous 3-dimensional definition (2.13)- 


f Jg if a =1,0 = 2 
<w= j -sfg if a = 2,0= 1 
[ 0 if a = 0 = 0 


(4.96) 


are the cellar components of the surface permutation tensor d> Thus, we may 
represent any vector v in the form 

v = i-^g, + an. (4.97) 

Assuming that everything in sight is differentiable, we find from this expres- 
sion that the rate of change of v along the u^-coordinate curve is 

y.i, = v^g. + v. „n + v*g„„ + vn „ (4.98) 

Just as we saw in Chapter III when we computed the acceleration v of a 
particle in a curvilinear coordinate system, so we see here that the change in 
v has two parts: the change in the components of v one would see if the frame 
}g a , n} were momentarily frozen- the first part of (4.98) and the change 
produced by the change in the frame itself— the second part of (4.98). In 3 
dimensions we were able to set g t J = rfj g k . However, because the g^ do not 
span E 3 , we have instead Gauss’ equations 

gz.fi = ri f g y + b tf n or V„g„ = b xfi n. 

The Christoffel symbols are defined by (4.77) and 


(4.99) 
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b 7f i - b fx = n x xf = n • p = ■ n p ■ g„. (4. 100) 

Incidentally, because n n = 1 and hence, n f n = 0, (4.100) tells us that n 
changes along a coordinate curve according to Weingarten's equation : 

V /( n = n /( = -b af g“ or B = Vn = b afi gV- (4.101) 

Here, V = g’V, and B is the symmetric curvature tensor of S. The adjective 
“curvature” means that, if we fix a point P* on S with normal n + and pass a 
plane through n + , then the intersection of this plane with S will be a plane 
curve C with curvature r B t at P*, where r is a unit tangent to C. See 
Exercise 4.27. 

In terms of the surface covariant derivative operator we may, with the 
aid of (4.99), express (4.98) in the form 

V = (V* - bf t v) g, + (v_ fi + v%p)n. (4. 102) 

(Note that B - B 7 implies that = g* r b y „ = g a> bl = b$. That is, as we 
noted in Chapter II, the two possible mixed components of a symmetric 
2nd-order tensor arc identical.) Further, applying the operator V to v yields, 
by (4.102), the 2nd-order tensor 

Vv = (VpV 7 - bjjv) g^g, + (v,, + v'b^n. (4.103) 

Upon contraction, we have 

V» = V/-2Hr, (4,104) 

where H = \K > s the mean curvature of S. If v is tangent to S, then v = 0 and 
V ■ v = V a iA which is sometimes called the surface divergence of v*. If v is the 
surface gradient of a scalar <j>, i.e., if v = = g x <f> ^ then (4. 104) yields 

= g^V'+jS A^, (4.105) 

which is called the surface Laplacian (or Beltrami's second differential ) of <j>. 


The Gaussian Curvature Revisited 

We end this section and the book by showing how the Gaussian curvature K 
can be expressed in terms of the curvature tensor B or, alternatively, in 
terms of the components of the metric tensor G and its derivatives (up to the 
second order), thus establishing Gauss’ famous Theorema egregium. Warning: 
In what follows, there is a plethora of indices and the dummies will be re- 
named without comment. Caveat emptorl 

Let be an arbitrary, smooth closed curve on S, parameterized by arc 
length s and enclosing a region R on S. Let r = t*g a be a unit tangent to dR 
and let v = r x n = be an outward unit normal to dR so that t* = 

Then, by (4.101), the divergence theorem (4.90), and the fact that, as in 3 
dimensions, the components of the permutation tensor are covariantly con- 
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stant Exercise 4,28 — we have 

0 = J" n ' ds 
= -J fe J} ,g y t a ds 

= J” c af b£% y Vpds 

= J (4.106) 

But SR (and therefore R) is arbitrary. Hence, assuming that the integrand in 
the last line of (4. 106) is continuous and noting (4.99), we conclude that 

'^[(VA'dg,, + *>«%"] = 0. (4.107) 

Now the coefficient of n is symmetric in a and ff while c af is anti-symme- 
tric. Moreover, an equation of the form w y g y = 0 implies that w y = 0. Thus, 
(4.107) reduces to the Codazzi equations 

e*%bi = 0. (4.108) 

Finally, let us return to our arbitrary unit vector field A = Integrating 
X around SR, we have, with the aid of the divergence theorem and (4.99), 

0 = | l' ds 



= J e*[(V,V.A 5 ' - A'6„ft/)g 7 + (b. /fi V x V + b n V„P + dA. 

But, by symmetry in a and fi and by the Codazzi equations (4.108), the 
coefficient of n in the last line of (4.109) vanishes. Thus, assuming that the 
coefficient of g r is continuous within R, it follows, because R is arbitrary, that 

- fb^bjf) = 0. (4. 1 10) 

Multiplying both sides of this equation by summing on repeated in- 
dices (as always), and noting that = — e*“, we have 

(4. 1 1 1) 

Referring to equation (4. 143) of Exercise 4.26, we see that, because | A| = 1, the 
left side of (4.111) is equivalent to our initial expression for the Gaussian 
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curvature, (4.92). We also see by equation (4.141) of Exercise 4.25 that the 
right side of (4. Ill) reduces to det[££]. But the bp are just the mixed compo- 
nents of the symmetric curvature tensor B. Thus, 

K = det B (4.112) 

Our final task is to deliver the promised intrinsic formula for the Gaussian 
curvature. Following the beaten path — no heresy here we observe that if 
x(u 3 ) has continuous third partial derivatives, then \ afJ = g apy = g x yfi , which 
is equivalent to the compatibility condition 

e' y g..„ = 0. (4.113) 

Using the equations of Gauss and Weingarten, (4.99) and (4.101), we expand 
the compatibility condition into 

+ Kp n), y 

= - b„pb.f) g A + (^ y + r^jn] = o. (4.i 14) 

The coefficient of n vanishes because of the Codazzi equation, (4.108). The 
vanishing of the coefficient of g* yields 

c lh (V^ y + r^r; y ) = c fr b a pby. (4. 1 1 5) 

To bring K into the act, note that equation (4. 1 40) of Exercise 4.25 implies 

that rJE'b^b* = g a)l e,' a K. Thus, multiplying both sides of (4.115) by 33 “%^, 
we reach our goal— an intrinsic formula for the Gaussian curvature: 

k = r^), (4. 1 1 6) 

a formula upon which many variations may be rung and wrung . 9 


Exercises 


4.1. If / = xyZyfx 1 + y 2 , compute dffds at (1, - 2, 3) in the direction of v ~ (2, 1, 2). 

4.2. The mean value theorem: A region is said to be convex if a straight line joining 
any two points in the region lies wholly within the region. (Compare Figs. 4.6a 
and 4.6b) 

(a) . If P(a) and Q(b) are points in a convex region R, show that 

S:x=(l — f)a + tb, 0 <: t £ 1 

is a parametric equation of the line segment S joining P and Q. 

(b) . Iff is a differentiable function of position in R, show that 

/(b)-/(a) = V/(xJ(b a), 
where x„ is the position of some point on S. 


See. Struik. op, cit. 
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(a) convex 


(b) non-convex 


Figure 4.6. 


Hint: On S, /(J?(t)) = f(t). Applying the ordinary mean value theorem of 
calculus to /. 

4.3. If / = xy + yz + zx, compute the cellar components of V/ in 

(a) , spherical coordinates (3.42) 

(b) . the oblique Cartesian coordinates 


4.4. Just as the action of a dyad uv on a vector w may be denoted and defined by 
uv w = u(v • w), so the action of a vector w on the dyad uv may be denoted and 
defined by ruv = (wu)v. By setting uv = u'v k g,g*, establish the identity 

V • (uv) = (V • u)v + (u • V)v. 

4.5. (a). Wc were led to the definition of the covariant derivative of the roof compo- 

nents of a vector field $(u') by setting v f = (t J gy).j = Vji’% and carrying out 
the indicated partial differentiation. Likewise, wc may define the covariant 
derivatives of the roof components of a tensor field T(u J ) by setting T = 
(T Jk gjg k ) i = Vj T Jk gjgir Show that 

v, r Jk = rf + r^, r pk + r‘ ( r lp (4. 1 1 1) 

and hence that the divergence of a tensor field takes the component form 
V • T = g‘T, = Vy T ik g k . 

(b). Assuming for simplicity that T = T 1 and setting T = T k J gjg k — T Jk g J g k . pro- 
ceed as in part (a) to show that 

v, T k J = T k J , + r>, Tf - rf* T‘ (4. 1 1 8) 

and 

V, T jk = T Jk i - rf, T pk - Tf* T JP . (4.119) 

4.6. In continuum mechanics, the linearized equations of motion of a body may be 
written 

V-T+f=pii, T = T r , (4.120) 


where T is the stress tensor (see Exercise 1.20), f is the body force vector per unit 
volume and u is the particle displacement. 



Exercises 


99 


(a) . Write out these equations in component form in Cartesian coordinates, 

where 

T = T xx e x e x + T xy e x e y + ■•• , f = f x e x + , and u = u x e x + ■ ■ 

(b) Do the same in circular cylindrical coordinates, where 

T - T" g,g r + T'°g r go + ■ , f = f'g, + , and u = u'g, + 

(c) . Express the roof components of T in (b) in terms of its physical components, 

T ( "\ 7' (r<> , etc. 

4.7. Show that V- (pi) = Vp. 

4.8. The equations of motion of an ideal gas may be expressed in the coordinate-free 


pv = pv , -(- p(v • V)v = — Vp, 

where t is' time, p is the density, v is the velocity and p is the pressure. With the 
aid of (4.39) and Exercises 4.4 and 4.7, show that we may cast this equation into 
the conservation form 

(/>v) , + V(pvv + pi) = 0. 

(This form is useful in numerical work. The term pvv will be recognized from 
Exercise 1.19 as the momentum flux tensor.) 

4.9. Establish the identities 

(a) . V(uv) = (Vu)-v +(Vv)-u 

(b) . V • ( /v) = (V/) ■ v + /V ■ v. 

<c). V-(Tc) = c(V-T) 

where T is a second order tensor field and c is any constant vector. 

4. 10. Use 4.9(b) and (4.37) to show that the equation of conservation of mass can be 
rewritten in the form p + pV • v — 0. (This equation gives us two equivalent 
ways to define an incompressible fluid, viz. p = 0 or V • v = 0. The second may be 
preferable in that it is strictly a kinematic condition, i.e. it does not involve a 
material property of the fluid.) 

4. 1 1 The LapUician , denoted and defined by V 2 = V V, is one of the most pervasive 
operators in physics and continuum mechanics. 

(a) . Show that V 2 / = y ,J V,f \ r 

(b) . Compute V 2 / in Cartesian, oblique Cartesian, circular cylindrical, and 

spherical coordinates. 

4 12. Establish the identities 

(a). V (Vu) = V 2 u, (b). V (Vu) T = V(V ■ u), (c). tr(¥u) = V u. 

4. 1 3. The linear field equations for elastically homogeneous and isotropic bodies con- 
sist of the equations of motion 4.120, the stress-strain relations 


T = xtr(E)l + 2pE, 


(4,121) 
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and the strain-displacement equations 

E = |[Vu + (Vu) 1 ]- (4.122) 

In (4.121), a and p arc constants called the Lame coefficents. By substituting 
(4.122) into (4.121) and the resulting expression into (4.120), and noting Exer- 
cises 4.4 and 4.9, show that 

(A + |i)V(V u) + //V 2 u + f = pii. 

These are called the Navier equations. 

4.14. Assuming that <j> and v are differentiable, show that 

(a) . V-(^v) s \<t> y + <f>\ v. 

(b) . V x (^v) s V<t> x v + x v. 

(c) . V(^v) = S(j>\ + </>V\. 

4.15. (See Exercise 1.23). The trace of a 2nd order tensor T is denoted and defined by 

trT= TJg'gj = T,‘. 

(a) . Show that 

trT = 7:| = g<% = g^VK 

(b) . Show that trT is an invariant, i.e., that T- = f/ 1 under the change of coordi- 

nates u j = t) J (u k ). 

(Equating a roof and cellar index of the mixed components of a tensor produces 
the components of a new object called a contraction of the tensor. Thus, in (b), 
the scalar trT is the contraction of T. If U = l/.V* &gjg\ then the vectors U‘.i t gj 
and UHjgi are both contractions of U.) 

4.16. Derive (4.20) by setting v , = (^g-'j.i and using (3.104). 

4.17. Use (3.71), Exercise 2.10, and the following observations to establish (4.24): 

(a) . If det[0,j] is expanded about the row (or column) containing the element 

Qij, the coefficient of g,j is J 2 g‘ i . This follows by applying (3.70) to the matrix 

G 2 . 

(b) . If a determinant is regarded as a function of its elements, then, by the chain 

odet[0„] dgu 

-du*- = J9 ft?' 

4.18. Compute V 2 i j/ in parabolic-, elliptic-, and bipolar cylindrical coordinates, as 
defined in parts (/), (g), (h) of Exercise 3.17. 

4.19. A Rigid Body Motion y = r(x,t) preserves distance between particles, i.e. 

|r(x,t) - r({,t)| = |x - a -oo < t < qo, x, £eS(0). 

Show that any such motion must be of the form 

r(x, t) = c (t) + Q(t)x, c(0) = 0, Q(0) = 1, 


(4.123) 
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where c is an arbitrary time-dependent vector and Q is an arbitrary time-depen- 
dent rotator (see Exercise 2.16). 

Hints: 

(a) . Set r(x, t) = r(0, t ) + g(x, r) and note that r(0, 0) = g(0, t) = 0 and g(x, 0) = x. 

(b) . Show that |g(ax, t)| = |a||x|. 

(c) . Use (1.11) to show that g preserves dot products, i.e., 

i(*,t)-gK,() = rUW 

(d) . Show that g is linear by showing that |g(x + & 0 — g(x, 0 — g(£, t)l 2 = 0- 

(e) . Rename r(0, t) and g(x, t) to arrive at (4. 1 23). 

4.20. The velocity in rigid body motion: Starting from (4.123), show that 

y = c(t) + e>(t) x (y - c(t)). (4. 1 24) 

Hint: Differentiate (4. 1 23) with respect to time, express x in terms of y and c, and 
differentiate both sides of Q • Q r = 1 to show that Q Q T is skew and that o is 
its axis. (Sec Exercise 1.18a.) 

4.21 The gross equations of motion of a body, mentioned in the introduction to 
Chapter III, arc obtained as follows. 

Let p(x) denote the mass density of a body B in its initial shape S(0) = S a . By 
definition, the mass of B is 

M = J pdV. (4.125) 

Let (4.30) describe the motion of B in an inertial frame with origin O. The 
linear momentum of B is then defined by 

L = J pydK = (J pydV^J, (4.126) 

and its rotational momentum with respect to O by 

R 0 = J" pyx.ydV. (4.127) 

The center of mass of B is the vector 

X = M~ l J pydV. (4.128) 

(a). Show that if we set 

y = X + z, (4.129) 

then 

J pzdV = 0 (4.130) 

and 

R„ = MX x X + pz x zdV. 

J s 0 


(4.131) 
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(b). Show that (3.4) reduces to 

T = Q pz x zdv'j s= R c . (4.132) 

R c is called the rotational momentum about the center of mass of B, 

4.22. The moment of inertia tensor appears when we compute the rotational momen- 
tum of a rigid body, i.e., a body capable of undergoing rigid motions only. 

(a) . Let the origin of the inertial frame coincide with the center of mass of the 

body at t = 0. With reference to Exercise 4.19 and (4.128), show that 

X = c, z = Qx, z = m x z. 

(b) . Show that z x z- Q • [(x x) I - xx] - Q 7 to and hence, since Q is a function 

of t only, that 

R c = Q I Q T n>, (4.133) 

where 

I = | p(x)[(x x)l - xx]dV (4.134) 

is the moment of inertia tensor. 

If p is a constant, compute the Cartesian components of I for 

(c) . A sphere of radius a. 

(d) . A rectangle parallipiped of dimension a x b x c with the alignment of axes 

shown in Fig. 4.7. 

4.23. Recalling that in the preceding exercise 

Q Q r = oix, (4.135) 

show that 

Q. r Q = (Q T o») x . (4.136) 

Hint: In order, show that Q T o> is an eigenvector of Q 7 CL that the representation 



Figure 4.7. 
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(4 136) is unique, and that, from Exercise 1.26, 

|Q 7 «>| 2 = |«| 2 = tr Q" T Q = tr Q • Q T . 

4.24. Euler's equations of motion for a rigid body are 

T* = to* x I - to* -(- | • m*. (4.137) 

Show that (4.137) follows from (4.132), (4.133) and (4.136) upon setting 

a) = Q.(o*, Rj. = QR*, T = QT*. (4.138) 

(Physically, these equations may be regarded as a change from an inertial frame 
to a non-inertial, body-fixed frame. See the comments at the beginning of Chap- 
ter 1 and Goldstein, op. cit.) 

4.25. (a). If A = [/!$] is a 2 x 2 matrix, show that the analogue of (2.48) is 

r.^e^det A = - A* A?. (4.139) 

(b). Show that (4,139) implies 

(.^AeiA=r. Xl ‘AlA» (4.140) 

and hence that 

det/4 = (l/2)e, / ,e 2 M‘[/4j!. (4.141) 

4.26. Use the 2-dimensional versions of (4.24) and (4.1 19) to show that V y e, # = 0, i.e., 
that the cellar components of the 2-dimensional permutation tensor are covari- 
antly constant. Also show from (4.95) that e aP = (g, x g„) n and hence that 
V 7 b 1(i = 0 follows from this expression with the aid of (4.99) and (4.101). 

4.27. (a). Using the 2-dimensional analogue of the identity (2.18), show that 

- A'V.A* = (4. 142) 

(b). Take the covariant derivative of (4.142) with respect to u f and use (4. 141) to 
show that 

V,(;.*V,A' - ;/V a A«) = + 2 det(V (J /. , J. (4. 143) 

Note that if A, A” = 1, then = 0 which implies that det(V /( 2“) = 0. 

Why? 

4.28. Let a plane intersect a surface S in a curve C. If r is a unit tangent to C\ show 
that the curvature at any point of C is given by t - B ■ r, where B is the curvature 

4.29. Let .S' be a surface of revolution given in the vector parametric form 

S: x = r(s)e r (0) + z(s)e z , a < s < b, 
where s is arc length along a meridian and 

e r = e x cos 0 + e y sin 0. 

The cellar base vectors may be represented as 

gi = g s = e,(0) cos a(s) + e z sin a(s) and g 2 = Se = r(s)e # (0). 
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cos a = r'(s), sin a = z'(s) 


and 


e„ = - e x sin 0 + e y cos 0. 

In the following, leave your answers in terms of r, a and (if appropriate) deriva- 
tives of a. 

(a) . Find n, the unit normal to S. 

(b) . By computing g* s , g, and g^, and expressing your results in the form 

g„., = n 5 g, + + b„ n, etc,, 

read off the Christoffel symbols and the cellar components of the curvature 
tensor B. 

(c) . Compute the mean and Gaussian curvatures of S. 

(d) . What is the expanded form of the Codazzi equations, (4.108)? 

4.30. Show that the solution for the differential equation for a geodesic oil a surface 
of revolution can be reduced to quadrature. 

4.3 1 . The 2-dimensional analogue of a string is a membrane , a 2-dimensional material 
continuum that can support tangential forces only. We assume that the mem- 
brane is in static equilibrium under external pressure and — if the membrane is 
not closed like a ballon— boundary forces. Modeling the membrane as a surface 
S, let dR be an arbitrary, piecewise smooth curve on S enclosing a region R and 
let N ds denote the force exerted by the material outside of 3R across a differen- 
tial element of arc length. Finally, let pdA denote the external force acting on a 
differential element of area of S. Then, force and moment equilibrium of the 
material comprising R require that 

pdA = 0 and J xxNds + Jxx pdA = 0, (4.144! 

where x is the position of points on S from a fixed origin O. 

(a) . Let v be an outward unit normal to dR, tangent to S. Introduce a set 
of Gaussian coordinates u a on S and apply (4.144), to a triangle whose 
sides consist of two differential elements of the coordinates curves of length 
y/ffudu' and ^/g^ 2 du 2 , respectively, intersecting at an arbitrary point on 
S, and a third side of length ds with outward normal v = v’g, = v^g*. As- 
suming that N = N(m < i , v), show that 

N = Ny)v' = NV)v„ (4,145) 

where 

N, =N(uV/V0 n )/v^ 1 ' and N 2 = N(h' gVF'Vv'S 55 - (4.146) 
Hint: First show that N(i/, v) = — Nlu^, — v). Then note that g'A/7'isa 
unit normal to the u 2 -coordinate curve and, by projection, -Jgudu 1 - 
v lg'/vV'). etc. 

(b) . Substitute (4.145) into (4.144),, apply the divergence theorem, and invoke 
the arbitrariness of the region R and continuity to derive the vector differen- 


1 N ds + J 
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tial equation of membrane force equilibrium, 

V.N* + p = 0. (4147) 

Likewise, show that if we substitute (4,145) into (4,144) 2 and use (4.147), we 
obtain the vector algebraic equation of membrane moment equilibrium, 

g,xN’ = 0. (4.148) 

(c). Because N and hence N” is tangent to S at every point, we may set 

N" = N af fy. (4.149) 

Representing the external pressure as 

p = p 1 ga + pn, (4.150) 

show that (4.147) and (4.148) take the component forms 

V.AP" + p* = 0, b^N*" + p = 0, = N f *. (4.151) 

4.32. A soap film is a membrane that can support only a constant normal tension T 0 
and an external normal pressure p, i.e., in a soap film, = T 0 g tfi and p* = 0. 
Show that in this case (4.151) I 3 are satisfied identically while (4. 1 5 1 ) 2 reduces to 

H=-2p/T 0 , (4.152) 

where H is the mean curvature of the film. (Of all surfaces spanning a simple, 
piecewise smooth space curve dR, a minimal surface has the least area. If a wire 
has the shape of dR, then a pressure-free soap film which spans this wire will 
adjust itself to minimize its energy which is proportional to its area. Thus, using 
a mechanical principle as we did to relate a tightly stretched greased string on a 
surface to a geodesic, we conclude from (4.152) that a minimal surface has zero 
mean curvature.) 

4.33. Suppose that a surface can be specified in the form 

S:z = i(x,y), (x, y) e D. 

(a) . Compute the cellar base vectors g, = g* and g 2 = g y , the Christofiel sym- 

bols, r} ! = r^, etc., and the cellar components of the curvature tensor, 
f>i i = b xx , etc. 

(b) . Compute the mean and Gaussian curvatures, H and K. 

(c) . Set H — 0, thus obtaining Lagrange's equation for a minimal surface, 

(1 + zf y )z_ xx - 2z x z y z xy + (1 + z 2 x )z n = 0. (4.1 53) 

4.34. Show that if we set 

K = fra/ig" x n + (l/4)(e ty g 0 ■ g,)n, (4. 1 54) 

then the Gauss and Weingarten equations of surface theory, (4.99) and (4.101) 
may be written in the form 

g,.* = + K x R, (4.155) 

and 


n , = k, x n 


(4,156) 
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